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Serial No.: Unassigned Art Unit: 1648 

Filed: herewith Docket: 9606Z-IY 

For: NON- PATHOGENIC STRAINS Dated: September 3, 1998 

OF HIV-1 

Assistant Conomissioner for Patents 
Washington, DC 2 0231 

PRELIMINARY AMENDMENT 

Sir: 

Prior to examination, please amend the above-identified 
patent application as follows: 
IN THE SPECIFICATION 

Please replace the paragraph under "CROSS REFERENCE TO 
RELATED APPLICATION" to read as follows: 

--The present application is a continuation application 
of Serial Number 08/477,464 filed on June 7, 1995 which is a 
continuation-in-part of application Serial No. 08/388,353 filed 
on February 14, 199 5.-- 

Please renimber pages 212 to 284 as pages 323-395. 
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I hereby certify that this correspondence is being 
deposited with the United States Postal Service "Express Mail 
Post Office to Addressee" service under 37 C.F.R. §1.10 on the 
date indicated above and is addressed to the Assistant 
Commissioner of Patents and Trademarks, Washington, D.C, 20231, 

Dated: September 3, 1998 



Sren DeSalv^ 




REMARKS 

The present application is a continuation application 
under 37 C.F.R. 1.53(b). 

Applicants have amended the specification to include 
cross-reference to the relevant chain of applications and have 
renumbered certain pages of the specification. 



Amendment be entered in this application prior to examination. 
Early and favorable allowance is requested. 



It is respectfully requested that this Preliminary 
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NON-PATHOGENIC STRAINS OF HIV-1 

CROSS REFERENCE TO RELATED APPUCATTON 
5 This ^)plic«txon i$ a contmualioo-in-part «n>I>^on of US Serial No. 38S»353 filed on 
14 February. 1995. 

FIELD OF THE INVENTION 

The piesent iziveatioix xelates to ncm^wtboeeaic strains of HIV-1 and to compooeots^ 
10 parts, fiagments and derivatives Aereof and to genetic sequeaoea derived tfierefitMn and 
tfieir use in Ae devdcqmiant of diagnoatie and tfaerft^^ 

and inrophylaxis of AIDS arid AIDS-celatoddisonlerf. The preseotinventioa also rdates 
to a mediod for attenuating patbc^enic strains of HIV-l by sMKageolzing particular 
regions of die HIV-1 genome. A particularly useffal ag)ect of Hie |Mnm invention ia 
IS a method for delmninii^ the likelihood or odierwise of an indMdual vAo is 
seropositive for HTV-l developing AIDS or AIDS-related symptoms. AnoAcr aspect 
of flic present invention is directed to snains of HIV<«1 capable of ayntbr ti/in g a 
inodifiM Nef proteui or having a wfld-^ype Nef fMRy^ 
readeiitv those strains of HIV-l aabstantially nooiirthQgeate, 

20 

BACKGROUND OF THE INVENTION 

BibliogFBiMc details of the publicaticMis referred to in tfiis iqpectficatioa are collected at 
the end of the descripticm. Sequenoe Idrati^ Numben (SEQ U> NOs.) for the 
nucleotide and amino acid sequences referred to in the q;)ectfication are defined 
25 following the bibliognqphy. 

Throughout 6us ^pectficatioa, unless the context requiret odmrise, the wwd 
'*coa|»ise^ or variation luch as "con^^dses" or "comprislflg"* ynSl be imderstood to 
imply the inchiston of a stated element €ft integer or groiqp of dbnenla or integers but 
30 not the exdunon of any other dement or int^ or group of akmems or inti^ers. 
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Genomic nucleotide sequences of HTV-l strains referred to heiwn are represented by 
their corresponding DN A sequence. 

5 Exemplary viral isolates refetwd to herein as "^Ig* and -C98- were 

PHLS Centre for Applied Kficrohiology and Research. European CoUcetion cf Animal 
Cell Cultures (ECACC), Division of Biologies, Poiton Dowa, SaHsbuy, Wfltahire SP4 
OJO. Cl« was dqwsited on 17 October, 1994 under Provisioaal Aocenion Number 
V94101706 and C98 was deposited on 31 October, 1994 under Pwvisiond Accession 
10 Number V941031 169. Viral isolate •C54" was deposited at ECACC on 10 March, 1995 
under Provisional Accession Number V95031022. 

A summary of patticalar deletion mutants of HIV-1 of the ptcseot mvcation referred to 
herein is given in Figure 11. 

15 

Acquired Immune Deficteney Syndrome (AIDS) and AIDS (dated diaotdefs are the 
clinical result of infection by Human Immunodeficiency Vinis Qrpe I ^IV-I) (Banw- 
Sinoussi «f erf, 1983). Infection by HIV-I » generally c h iwilri l afd by vov^sOyc 
immune system damage (Teeuwaen et at, 1990; aeiiei «r 19i9) leading to 
20 <^)paclunisttc Infectimn, malignancies or wssdng qmdrMBe Ihnt oonilitnte clinicalty- 
defined AIDS (Btiscfa tt n/, 1991; Klaslow «r a/, 1990). 

n>e high mortality rate of individuaU infected with HIV-1 togedier wi* Ac aocial and 
economic consequences of the continuing HIV-1 endemic has created an ucgeoA need 

25 for a safe and effective treatment and/or prophylaxis agidnst Ac devartHing eUbcts <rf 
AIDS. However, despite over a decade of hi|^ level ideatific iCMardi into the 
pathogenesis of HIV-1 and the dinical manif<ntfatk>ns of the diaeaae, together with a 
detailed molecular analysis of tib« virus. Acre has been little success in the devdopjMOt 
ofanefifecttvevacdne. Todate,diemostefiixtivcdieripy istreBhaentwWiadovudme 

30 (AZT) which delays the onset of fuU blown AIDS and aUeviatea to aooe eodent tfie 
8ynq>toms of HIV-l infection. However. AZT is not an innocuous compoimd and AZT, 
metabolic products thereof or impurities therdn can cause a number of aide effects 
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which limit long term treatment with the drug. Furthennorc, AZT reastsnt isolates have 
been reported during tieatment Ocariy, therefore, a need exirti to develop ahemative 
strategies in preventing and treating HIV*1 infection. 

5 The initial phases of HIV-1 infection arc summarised by Levy (1993) as involving 
attachment, fiision and nucleoci^d cutty. These pihases have been the traditioflal fbci 
in research into developmoit of antiviral strategies. The molecidar events at die virus 
genomic Icvd have also been the subject of intense scientific research with an aim being 
the development of a live vttemntM vaccine as apoasible apptoadh for the traatmeat or 
10 proidiylaxis of HIV-1 infisction. 

There is a high variable rate ofprogicssion from initial mv-l fcafcctioii lo AIDS iwhich 
reflects a r^ndly changing pathogen and variable immune ic^ponae of die host to 
infection (Sheppard et al, 1993). With regards to the latter, HIV-l can be considered 

IS as a heterogenous groq;> of viruses differing at tiie genetie level wUh oonooaaitant 
variable pathogenicity. For example, HIV>1 strains can diflbr la their eaprndty to IdD 
cells. FurAennore.it appears Aat HIV-1 strains evolve in a boat after infcctioo and that 
die evolutioa varies dqpeiiding on die tissues iafboted by die vkna. Hm major aites in 
die genome apparently responsible for biological and padtoiogteal variation are die 

20 highly variable envelope region (Cheng-Mayer «r at, 1991; Shioda et al, 1992; Hwang, 
era/ 1991; Sullivanef «/. 1993; Oroeninkef of, 1993) and dm viral wgdatoiy regions 
such as f<tf(Leguem era/, 1993). The genetic oomptexity of die HEV-1 group of viruses 
together with dieir variable padiogeaieity, are major difficukiea in die develupment of 
live vaccines, genetic vaodnes or oompotient vaccines. 

25 

Notwidistanding die highly padiogenic nature of HIV-1, dieie aie some icporis of long 
term survival of subjects infected widi die vima (Learmont <r nf. 1992; Levy, 1993; 
Sheppard «r a/. 1993; Ufton er 0/1991). It is not always dMar. however, whedier a 
benign course following HIV-1 infection is dua to host j&ctors. viral factors or oAer 
30 unknown fiKtors. tliere are reports diat most infected peopk han^e at kast laboratory 
evidence of p rogre sri ve immune aysCem damage in die form of CD4^ cell loss O^ang 
et al, 1989) and defective immune responses (Qerid et a/, 19t9). 
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Although simian monkeys have been used as an /h vivo model for HIV and Simian 
ImmunodelSdency Virus (SIV) infection, a nuyw handicip in AIDS leaeitrh has been 
the absence of suitable in vivo models to study the pathogeneaia of the disease and, in 
particular, to study the viruses involved in benign infection. The need for a suitable in 
5 vivo model is heightened by the fact that results obtained in vtro cannot necessarily be 
extra|>olated to what occurs £ff vhto* This was clearly obserined by Kfoser^r of (1993) 
where conflicting results were obtained in animals compared to cell culturea. 

Despite the absence of suitable in vivo Okodels, ooasiderahle ackotific raaench has been 
10 directed to attenuating HIV-1 strains by mutagenesis of die virus genome. Ddcdons in 
the ni^gene have been impUcated in attenuated strains of SIV «m1 tfieir iw 
protective effects in monkeys (Danid et al, 1992). However^ there are oooflictiQg 
rqmtt on the possible opgati ve influence the n^gene piodaot has on llie rate or extent 
of vims replication (Tenvilligerer of, 1986; Ludwero/, 19r7; Medemaan^/ot. 19S9; 
IS Kime/a/, 1989; Hammese/oT, 1989). In &cc, Kim er 4^(1989) found tfntii^did not 
afifectmv-l replication or HlV-i long lermiittl repeat (LTKHrfvca GAT ci^^ 
r-Ttlrr m rr rrf fnnnil thsf Ifin rii/£f nr ii Mii|niii it fiji flill |i1lMHi nir pntf itfiil 

in SIV. However^ such is tfie comidexiQr of die HIV-I giroup ct viniKa nd the 
variability of immune re^onses between individttab kt atone diffisreoC apecies» that it 
20 is far from clear ^Kdiefher nefdeleted strains of HIV*1 would behave aonilaily to n^ 
ddeted strains of SIV-L There is a need, fterefore, in order toinnmtigi^ 
of deleted HTV-l strain as a vaccine candidate^ to identify iodividiMb infected widi 
such mkodified viruses. 

25 Lcarniont era/ (1992) rq;)ortedAat a cohort of five peraonsinftoM 

from a single HIV*1 infected donor have remained asymptomatic from up to about 10- 
14 years after infection. Sta»sequendy» a sixdi pctaon has bceo identifiad as being part 
of the cohon Botfi the donor and redpienla were HIV-1 aaropoaitive b«it widi no 
indications of clinical syn^stoms of HIV-1 related disease and GI>44^ cdD naniber and 

30 fl^'^^^S^^^^ "^^^ remained in the normal range. The ideiHtftctatioti of Aie 
cohort of benignly infeOed individuals provides a unique in model in which the 
pathogenesis of HTV-l infection can be studied at Ae dinical and motocular biological 
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leveU. 

However, it has not alvmys beea possible using conventional Isolatloa fMoeedises to 
routinely and lefiroducibly isolate viral strains from the above mmfi oB ed donor or 
S recipients which has fiusixated effoits to investigate the cause of the asymptomatic 
individuals. In accordance with the present invention, mediods have now been 
fftublf^^ to isolate viruses from the above individttals. It has been delennaied, in 
accotdaiM^ vviA the present invention, diat individuals of the cohort are iafeeted by noiH 

padiogenic strains of HIV-1. Furdiennofe, the aoa-patfaogenie strains of HIV-I cany 
10 one or more nucleotide mutations. The iMn-padiotenic strams of die present invention 
enable die ^aeration of « range of therapeutic, diagnostic and iKgding agents «gainst 
HIV-1 infection. The prasent invention also anables the attmuarion of pravioasly 
pathogenic strains of HIV-1. AdditionaUy, an invesdgatiaa of the tmnmnoiogical 
profiles of e6btut individuals has revealed diat « noniMtfiogeaic siiain of HTV-l is 
IS indicated by a particular deletioa in tib« coding region <tf^aproldbiiesuIliBg in an altered 

immunological profile for the exp ra s sed protein. An eaanip ie of an altered 
imnnmologiGal profile fcsdts from a deletion of oertafai amino MBids in 4» Ne# protein 

SUMMARY OF THE INVEPOION 
20 One aspect ofthe invention is directed to an isolated HIV-I strahi or a eomponent. pact, 

fragment or derivative Aereof which It aubstantiaUy ooiit^paflMgeaic. 

Another aspect of the invention is directed to an isolated strain of mV-l or a biologieal 

source diereot said HIV.l having tb« following diaracterislicK 
2S (i) is substantially noii<iMrthogenic in tamuttsubjeoii; and 

00 carries a modified ntf gene wibkh eaoodea a it^ geae piodoet 
substantially immunologically nooHmeractfve widi atrtibmSes to amino 
acids 162 to 177 of Nef In wUd-type HIV-1. 

30 Yet another aspect of the invention is directed to an isolated DOOiMdmcwi^ 

mV-t comprising a genome which Is substanliaUy IncapaUa of Iqrbcidizmg under 
medium stringent conditions to a nucleic acid molecule conag rism g a aequenoe ci 
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nudeotides which eticodes all or part of amino acids 162 to 177 of wild-type HIV-1 
Nef 



Still aiiother aspect of the invention provides a non-pathogenic HIV-1 iaobie which: 
5 (i) induces an immune response in a human or primate suliiject; 

(iO does not substantially produce a prolifisiative ttspoim or cytokine 
pioduction to a mitogen, alloantigea and/w rscall antigen relative to a 
healthy, non-infected subject; and 
(iii) is substantially inc^iabteofindudiv an antibody le^MiiK to 
10 162 to 177 of wild-type HIV-1 Nef protein. 



Still yet another aspect of die inventicm conteaplates a viral iaolaifie which: 

® is interactive to antibodies to a glycoprotein from HIV-1 adaded fiom 
gp41-45, gpl20 and gpl60; 
IS (iO is substantiaUy Aon*patfiogenic in hunaan subject!; and 

(iii) carries a deletion mutation of at least ten nndeotidea in a region 
corresponding to an tM-part of amfaio acids 162 to 177 encoded by the 
gene of a pathogenic strain of HIV-L 



20 Another aspect of the invention provides an isolated strain <tf HIV-1 wfeddi is reactive 
to antibodies to a glycoprotein of HIV-1» is capable of tndudog an imnrane response to 
at least one of gag^ pot and/or env and which is incapable of diveeting qmdiests of a mrf 
gene product or a full lengdi pte/ gene product 



25 A further aspect of Ae invention contemplates a mediod fbr inhihitnig or reducing 
productive infection of an individual by a pathogenic strain of HIV^U aaid metfiod 
Gomimsing administering to a sul:9ect a ncm^^padiogenic isolate of HIV^l in an amount 
effective to infect target cells and to generate target ceils canytaig DMA derived from 
said non-pattkogenxc HTV-L 



30 



In yeCMOttwr ftjpect of tiic invcntioa tbac is contemplated a method for vacdnating an 
individual against die development of AIDS or AIDS related diseases, said method 
comprising administering to said individual a non-pathogeaic isolate of HIV-I in an 
5 amount effective to infect target cells and to generate target cells carrying DNA derived 
fion aaid non-pathogenic HIV-I. 

In still yet anottier aspect of tfie invention there is provided a method for obtaining a 
pcqMntion of non-pathogenic HIV-I fitmi a biologfcal sample; said method comprising 
10 co-cuhuripf PBMCs from said biological san^le from an individual putatively infected 
with said noDiMthogenic HIV-1 with HTV-l seronegative donor PBMCs deleted f<^ 
CDt^ cells; harvesting die PBMCs and supernatant fluid every from about 5 to about 
1 0 days and adding fresh medium with CD8*«- depleted PBMCs with said fresh medium 
and jsolaring aaid virtis from the supernatant fluid. 

IS 

AnoAer aqiecf of tfxc inventton contemplares a method for obtaining a pr^wration of 
nofrfathoteme HIV-1 &cm a tHological sample; said method convrising oo^ulturing 
monocytes from said tnological sample from an individual putatively infected with said 
oon-patbogenic HIV-1 with HIV«1 seron^ative donor PBMCs dq)leted for CD8+ cells, 
20 harvMtiQg flie mcmocytes and PBMCs and supernatant fluid every from about S to about 
1 0 d^ and adding fresh medium with CD84> deleted PBMCs with said fresh medium 
and iaolafing said virus from the si^etnatant fluid. 

In yet a ftmher upccx of the invention there is contenvlated a mediod for identifying 
25 or acreeoing for compounds capable of reducing or otherwise interfering with HIV-l 
rqilication^ said method comprising contacting a ccHupound to be tested witti a cell w 
cell extract containing or cqwble of containing a nef gene product fused to a reporter 
molecule capable of giving an identifiable signal and screoiing tot a coa4;>ound which 
inhiUti said signal 



30 
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In scQl yet % flstber aapect of the inventton there is provided a viral isolate which: 



(9 is genetically or immunologically related to a pathogenic strain of HTV- 1 ; 

(v) is substantially oon-patbogenic in human subjects; 

5 (tiO comprises a first nucleotide sequence constituting its genome which is 

capable of hyMdising under medium stringency conditions to SEQ ID 
NO: 1 or a complenientary form thereof; and 
(iv) oomprises a second nucleotide sequence within said first nucleotide 

sequence and which second nucleotide sequence directs eoqmssion of a 
10 mRNA molecule capMc of inhibiting, reducing or otherwise down- 



regulating translation of a protein or polypeptide encoded by apathogenic 
strain of HIV- 1 or inhibit, reduce or otherwise down regulate operation 
of a non-protein «ooding a tepoa of a pathogenic strain of HIV- 1 . 

IS Still ytt anodiQr a^)ect of die invention is directed to a meOod for detennimng die 
paitiopniaty of an HIV-I strain after said HIV-l strain infects cdb dm indtvidual 
said metfioil nnmpriiinj: drtrrmininj thn pminrr Ttf a ttrlrrimi miitatiim in ihrr (^"^^^^ 
of wM WVA wherein said deletion nnitation resulta in said genome betog tmable to 
syodMue a polypq>tide or protein from a pathogenic strain of HIV*1 or directing the 

20 ^mhesis of a truncated form of said polypq>tide or fxotein wherein the incaeiice of a 
sudi a MMtatioii is indicative of ^ presence of a non-padiQgenic strain of HIV-1. 

A tetfier a^McC of the Invention coDteo^lates a method tor determining the 
pathogenicity of a strain of HIV- 1 after said HIV^l strain infects cells of an individual, 
25 said msdiod oomprising comacting a Inological sample fiom said individual with a 
peptide corresp o t KUn g to a deleted or truncated region of an HIV-l-derivcd protein and 
scrasoiog for tbe absence of atxtibody binding to said peptide, vriierein the absence of 
antibody iHiiding is indicative of a deletion or truncation in ttiat protein and fiirtiier 
indicative of the ncmrpatbogenicity of said strain of HIV-1. 

30 
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Anoflier aspect of the invention is directed to a mediod for detennining the pathogenicity 
of a sttain of HIV-l after said HIV-1 strain infects cells of an individual said method 
comprising contacting a biological san4>le from said individual with an ^ective amount 
5 of a peptide having an amino acid sequence oonptising or widiin amino acids 162-177 
of wild-type HTV-l,^, Ne^ said contact being for a time and under conditions 
suffident for an antibody if present in said biological sample to fonn a complex with 
said peptide and then detecting die |»esence of said complex wfaczdn d»e absence of a 
complex in an individual seropositive for HIV-1 is indicativB of that individual being 
10 infected with a non-padiogenic strain of HTV-l. 

Still another aspect of die kvention provides a niedMd for detennining die ptfhogem 
of an mv-l stmitt after said mV-l strain infects cells of an individnal. said method 
oon^rising determining die presence ofannitation in die genome of said wfaerdn 
IS die presence ofdiemutadon results in a Kef prateinsubslBiitiaUyladd^ 

162 to 177 of Nef of a wild-type strain of HIV-1 and wherein said mutatiott is indicadve 
of a non-pathogenic strain of HIV-1. 

Still yet anotfier aspect of die Invention is direefed to a syndiede peptide comprising a 
20 se(iuenoe of amino acids defined by SEQmNO:80I or a part or firaginemdie^ 

In yet a ftiidier aspect of the invention there is cottteflC4>Iated a medkod fiir determining 
die risk of an individual setopositive for HIV-1 developing qrmpCooM of AIDS, said 
mediod comprising contacting antibodies from said patient widi a qroAetie pq>tide 
2S defined by SEQ ID NO:>01 orapart or derivative diereof and detocring non-binding of 
antibodies to said peptide wfaeiehi dK substantial absence of antibodica to said peptide 

is indicative of a low tisk of die individual developing AIDS. 



I w wi>|ii II 1 1' » ^■vi.uiA.io 
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BRnr DESCRIPTION OF THE FIGVKES 

Figan 1 » « vqireseatation showiog the ali^nmeiit of the aucleotide 9e(iueiioes from 
donor D36 pcripbeial Mood mononuclear cell (PBMC) [D36P] and non-pathogenic HI V- 
5 I finn racipiait CIS HIV^ [CI8S], CIS HIVmbc [CISM] and C98 HIV fC9SH] and 
CS4 PBMC (C54P] with the eqinvaleotregioa of HIV-I,^. Sequences labelled PBMC 
an fttoopctiaatPBMQ thow labelled HIV are fiom virus isolated fiom patient PBMC 
and grown in culttae. Nianbeting for Vay-lfOAJ » *s pa- Myer *t eU (1992; 1994) 
where nucleotide 1 is the fiist nucleotide of ttw complete proviial DMA sequence. 
10 D3dP, CISS. C18M, C98H and C54P are numbered fiom the start of the region 
sequaaoed. Identity witfi NL43 sequence is shown by (*). Deleted nucleotides are 
AowitbyC-). Spaces introduced to maximije alignment are shown by Q. Features in 
HrV-l,^ are Buufced by oveiiining the sequence, features in D3< and CIS are inarked 

by underUning die s equence. 

15 

Figm 2 Aovn the alignnwm of eiiooded aoiiio acid seqoeaces of (a) Mr exon 3 an^ 
(b) com 3 from fflV-lMuij. D36 PBMQ CIS IflV^ and C98 HIV. In-phase 
tenainttion codoos (*) and NL43 encoded amino add iiumbers are shown. 

20 FlfTs 3 is a repmcutaiion showing the alignment of C-tenaainal envelope glycoprotein 
gp41 m&M mid sequences encoded by D36 PMBC. CIS HlVstv CIS HIVmbc 
C9S HIV. Numbeiing is that of the amino ac^d sequence of the mature envelope 
glyooptoieinof HlV-lf^. Tenninatioo in dwwn by (#). 

25 Figm 4 is a ripresentation showing aUgnmeni of amino add sequences encoded by Ae 
ncf gwes of HIV-1,Q^. D36 PBMC. CIS HlVg^v* ^^mbc *^ ^ 
phase trrmiir*^ codons are shown by (M). Identical amino adds are shown by {*). 
Residues underlined are titose immediately before a deletion. 

30 Figara S shows a di^lication of NFAB and Spl sequences m D36 PBMC, CIS HIVs,v> 
CIS HIVmbc ^ demonstrated by alignment of tiieir low homology region 

sequenoes witii die NFiSS-Spl region of HIV-l^). Nudeotide numbering according 
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to Figure 1. Identity wiA NL43 sequence shown by (*) and NFJOS and Spl sites in 
NL43 overlineA Position of fuflLTR region sequence deletkm shown by (a). 

Figure 6 is a grqMcal representaiion showing iqiUcation of CIS ad CH vini isolates 
5 and D36 PBMGt fiom asymfHomatic podenis in PHA-slimufaMed PBMC^ 




10 ' 

Figure 7 is a graphical r epr ese nta tion showing replication of vinI isoiaies fiwn 
a^ptonurtic patients in noo-PHA stimulated PBMCs. 




228200 is an AustraUan isolate of HIV-I described by VOmnm, R. s#«f (1990). Its 
<&neteristics hidude bdng T oeD trophic, with fiut Utie^ 
20 and/or SI i^benotype. 

Fignre 8 is • gni^iical rqpresentation of cell sui&oe nceptar eKpttaiion fbr syn^tia* 
inducing (Siy non>$yncytia-indudng (NSiyasyinptomatic paHeoC iioliiaa 



25 



■ 


Tm 


■ 


IL-2R 


■ 


CD4 



228200 is defined in the legend to Figure 7. 24392S iaavindiaoltfBerinV.l which 
30 is monocyte/knacrophage trophic and exhibits NSI pheno^ (Dr Km Coala>Fiyar. 
PhD thesis entitted "Viral detenninants of mV-l syncytium fiitaMdoa", «be Univcfsity 
of Melbourae, ParicviUe, ^^etoria, Australia. 
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F^ara 9 is a icpFcseotadon of nucleotide sequence of CIS HIV-Imbc ^ 
800). 

Figure 10(a) - (g) at« gnrbictl rq)resetttations showing dimcal imaniaology of colioft; 
5 (a) CD3; (b)(i) CD4 Oi) CD4%; (cXO CDS; (il) CDS%; (d) tynvhocyie count; (e) 
CD4/CD8 ndo; (Q p-2-micfoglobuIin; and (g) Kaplan-Kfeier estimates of time to 
disease progression (Cox A Oakes» .19S9). 

Figure 11 is a s c hrm atfc lepicjeulati on of the ddtotion mutants of tihe pwseat invention^ 

10 

Figure 12 shows reactivity of sen fiom LTP individoais (la); HIV«l-ve indtviduab 
(Ibi. Ibu); individuab widi autoimnmne disease (A/HIV-1) (IbiiO; LTNPl (le) and 
LTNP2 ( 1 d) with fiiU length Nef 27 derived fiom HIV- l^fg^ (refined to heeen as *^lef 
27^. The tenn "LTNP" is an ahbreviation for "Long Tenn N o ii Pr ori f* *' NTNPl 
15 and LTNP2 are defined in Example 16. 

Wells of 96-well polystyrene micraCitre plates coated with purified Nef 27 (100 ng^O 
were incubtfed with seta (titrated fiom 1:100 to 1:10^000) obtafamd fium LTP 
individuals. HIV-l-ve indt^doals, mdividuals widi utoinnnww dUw aa e, LTNPl and 
20 LTNP2. The presence of antibodies in the seca which reoogniae Nef 27 were detected 
using a hiotin-streptavidin HRP ^stem with o^dieayleaediaote at substrate. 
Absorbanoe was measured using a Titcftdk plate leader at wKvdeogdia of 630 aiid 4S0 

nm. 

25 Figure 13a shows leactivityofseia fiom LTP individuals againatNef-dMcived peptides. 
Synflieticp^ddesconesponding to amino add residues ltol9(0,20to360i)»44tD 
65 Oil): 72 to 83 Civ). S9 to 97 (v9): 109 to 114 (vQ. 164 to 186 (vl?), 187 to 206 
(vih^, 121 to 135 (ix) and 162 to 177 (x) of MV-lwi^, Nef 27 were coated onto wdk 
of96-wdlmictotitre plates at aoonoentiation of 500 ng^welL 8em(tiMledfi«im 1:300 

30 to 1;100,000) fiom the LTP individuals were then incubated with tm imBiobiliied 
pqiddes and die preaeaoe of antibodies in flie sera whidi laeognise Ac NeMoived 
peptide were detected uatog a biotin-strqyuvidin HRP system widi ofhsiqdenediamfaie 
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as substrate. Absorbonce was measured usiug a Titeiiek plate reader at 
630 and 450 nnt 

Figare 13b^ siiows reactivity of sera from HIV-l-ve individuab agimat Nef-derived 
5 p^des. Synthetic pq>ttdes«>n«^pondiflg to amiiw add readues 1 to 19(iX 20 to 
(it). 44 to 65 (m), 72 to 83 (iv), 89 to 97 (v);. 109 to 1 U (vO 164 to 186 (m), 187 to 
206 (viii). 121 to 135 (tx) and 162 to 177 (x) of HIV-1^, Nefwoe eoated onto wdls 
of96-weUmicn>titrej>latesataootieenlFationof500iig/<«v^ Sen (titraaedfinm 1:300 
to 1:100.000^ ftom A/HIV.I .v# mAiviA»MU uritt% mnt^if ! i^* ^ i f U f f f - «Hf thf" inn*^*^ 

10 wiA AetmmobUi36dpq)tidesandtfaepMs«ieeofattti1)odiesjntbeaera 

die Nef-derived peptides were detect uang a biotiii-strepttvidlii HR? ^rstem wiA o- 
pheoy k nediami n e as substrate. Absmtence was measured usiof a Titertdc plate reader 
at waveleosdis at 630 and 450 mn. 

15 Figiire 13b(a) shows reactivity of sera fiom autoimmune A/HIV-l-ve individnals 
against Nef-derived peptides. Syndwde pqitides oone^wndiiy to amiBO Mid lesidoes 
! to 1 9 (iX 20 to 36 00. 44 to 65 CiiiX 72 to 83 (Iv). 89 to 97 (vX I0» to 114 (vO, 164 
to 186 (vii)» 187 to 206 (viii), 121 to 135 (ix) and 162 to 177 (x) of HIV-lNt43 Nef 
were eoated onto wdls of 96-weU microtltre plates at a oonoealcatioa of 500 ng^weO. 
20 Sen (titrated fiom 1:300 to 1 :100,000) fiom A/HIV-l-ve indhridnals wUh amoinmmne 
disease was then incubated wiA fee immobilised p^des and lha pre atans of antibodiea 
in the sen which recognise die Kef-dexived pqMidea were delected using a biotin- 
strqptavidin HRP system with o-phenylenediaffline as tubaUitw. Abaocbanee was 
measured using a Titerek plate reader at wavdengtfis of 630 and 450 an. 

25 

Figure 13c shows reactivity of sen fiom LTNPl tndividnate laaiiiii Nef-derived 
pq)tides. Syndietic pgptides confrqwnding to amino add reskliies H» 19 (0*20 to 36 
00. 44 to 65 Ciii). 72 to 83 (iv). 89 to 97 (v). 109 to 114 (vO. 164 to 1 86 (viO. 187 to 
206 (viiO. 121 to 135 (ix) and 162 to 177 (x) of HtV-l^, Nefwcre oofltad onto wells 
30 of 96-wdl microtitre platea at a ooneeottation of 500 ng/wdL Sen (titrated ftom 1:300 
to 1:100.000) fiom the LTNPl individuab were then incUbfltod with fte immobiliaad 
pqptides and die presence of antibodies in the aen which reoognise 4e Ncf-derivad 
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peptides were detected using a biotin-streptavidia HRP system wiUi o^fibenyknb^mnk^ 
as substrate. Absofbance was measured using a Titertek plate reader at wavdex^gtba of 
630 and 450 nm. 

5 Figure 13d shows teBCtivity of sen fiom LTNP2 indMduals i«atast Naf-derivtd 
peptides. Synthetic peptides coireipoiidxng to ao^ 1 lo 19(iX 20 to36 

(ii), 44 to 65 (iii), 72 to 83 (iv). 89 to 97 (v), 109 to U4 (vO, 164 to 186 (viiX 187 to 
206 (viiiX 121 to 135 Ox) and 162 to 177 (x) of HIV*1^| Nef wck coaled onto wdls 
of 96-weIl mictotitre plates at a coaceatntioa of 500 ngAivell* Sera (titrated from 1 JOO 
10 to 1:100,000) from the LTNP2 individuals were dien incubated with the immobiUaBd 
peptides and the presence of antibodies in the aera which rooogniae the Nef-derived 
peptides were detected ustog abiotin-streptavidin HRP system with g-phenytoiodianitne 
as substrate. Absorbanoe was measured u^og a Titertek plate reader at wavckngtha of 
630 and 450 not 
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A summary of the SEQ ID Nos. used in the subject specificattoo is Aown below; 



5 


SEQ ID NO: 


DESCRIPTION 




1 


Nucleotide sequence of HIV»1^^ genome 




2-613 


Decanudeoddes of rte/ gene fiom fflV-lj^n^ 


10 


614 


Partial nucleotide sequence of D36 HIV-1 isolde 




615 


Partial mtcleodde sequence of CIS HIV*I|^|b^ imsiMc 




616-625 


PCR primers shown in Table 1 




626^33 


Sequence primers shown in Table 2 




634 


Amino acid residues 15-27 of HIV-l^^j^i w/psalein 


15 


635 


HlV-lvir^t ^ cwns (Figure 2) 




636 


mV-l D36 tat exms (Figure 2) 




637 


HIV-I CIS tat exons (Figure 2) 




638 


HIV*I}4i^ rtv exons (Figure 2) 




639 


HIV-1 D36 tw txons (Figure 2) 


20 


640 


HIV-1 CIS fw exons (Figure 2) 




641 


HIV-1^41^ C-tenninal of gp41 (Figure 3) 




642 


HIV-1 D36 C-terminal of gp4t (Figure 3) 




643 


mV-l CIS C4cnninal of gp41 figure 3) 




644 


HlV-lur^t ne/gsat (Figure 4) 


25 


645 


mV-l D36 ne/geae (Figure 4) 




646 


mV-l CIS ne/gene (Fignre 4) 




647 


HIV-1)4L43 NFKB/SPl te<tnenee (Hgom S) 




64S 


HIV-1 D36 NFKB/SPl tufotax (TigHC S) 




649 


mV-l C18 NFKB/SPl aequenoe (Figm 5) 


30 


650 


Nucleotide sequence of nrf gene fion 




651 


Nucleotide sequence of env and nrftt^fiooM of HiV-l|iL43 




652-799 


Decamideotides of LTR f^jion fixm HnT-l^ji^ 




800 


Nucleotide sequence of CIS HIV-1,^ 




801 


Amino adds 162 to 177 of wild-type HIV-Imus Nef 


35 


802 


Nucleotide sequence encoding emino Midt 162 to 177 of wild- 






type HIV-1nm> Nef 




803-841 


Decenucleotide deletion of Nef gene covering naino adds 162 to 



177 
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DETAILED DESCRIP1TON OF THE PREFERRED EMBODDtfENTS 

One aspect of the present invention contcmplatei a non-padiogenic tiohte of HIV-I or 

a component, part, fragment or derivative thereof 

5 

In a related embodiment, there is provided a novel isolate of HIV-I or a oompooent, 
part, fragment or derivative thereof vvherein said HIV-1 isolate is capiMe of stimulatiog 
in a human or primate subject an immune response such as a hmonl rfqwrntf 
to at least one HIV-1 glyooprotdn such as but not limited to gpI20 and/or 

10 gpl60 while not substantiaUy redudog in said human or primate satyact proliferative 
rehouses and ^Une production to a mitogen^ aUoaatjgad and/or recall antigen 
compared to a healthy, iKm-infected human or prinurtesut>|^ Fireficxabty* die cytokine 
isIL*2. P^ferably, the mitogen is ConA or PHA and the recaflantif^ 
tetanus toxoid. tYefena>Iy, the HIV-1 isolate is noo-paOogettk. 

IS 

More particulariy, the present invention rehOes tt> an isolated MV-l atnio which: 
0 is suhstantidly noo-paOogenic in human sul^ects; and 
00 canies one or more mutatioos in its genwie resuttiiv ta die ioaUliQf to direcc 
synthesis of at least one patfiogenic HIV-I-derived po^rpeptida or protein. 

20 

Even more particulariy» the present invention provides an iscdatod HIV-1 stram vAuch: 
(0 is substantially non-pathogenic m human subject^ and 
(ii) carries a mutation in the mf geoe and/or a long tenmnil rqpeaC (LTR) ragian or 
in a fltnctionally equivalent location in die HIV-1 geoottc. 

25 

Still even more particularly, tfie present invention is directed to aa ifolated vims which: 
(i) has a genome vrfiidi is capable of hybrididug under nuidlwm atilngeacy 

conditions to complementaiy nucleic add from a path og mi c staia <rf HIV-1; 
0i) is substantially non-pathogenic in human subjects; 
30 (iii) carries one or more deletion mutations hi a region of ite gfinoiM oorrespo ndin g 

to a n^gene in said patihkOgenic strain of HIV-1; and 
(iv) optionally carries a mutation in one or bodi LTR r^Iooa. 
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In A ttiatcd embodiment, there is provided an isolated virus whidi: 

(i) has a gexxnne ^ch is c^iable of hybridisiog inder medhim stringency 
oonditioos to complementary nucleic add from a pathogenic strain of HIV-I; 

(ii) is substantially oon*patfK>genic in human subjects; 

5 (iii) carries one or mote deletion mutations in an LTR region of its ffcnome; and 
(iv) optionally cairies a mutation in a region concafponding to a mf gene in said 
pathogenic strain of HIV-L , 

In a further related embodiment* there is provided an isolated virus «4iicfa: 
10 (f) has a genome ivhich is capable of hybridising under medium stringent 
conditions to complementary nucleic acid from a pathogenic strmtn of HTV^l; 
(iO is substantiaUy ncm*pachogeidc in human subjects; and 
(ill) carries one or nKMfe deletion mueBtions in a r^on its f cnom e concqponding 
to a region vAikh coiktains n^coding sequences and LTR miriwitide sequences. 

15 

Anodier aq)ect of the present inventioa is directed to an isolate era 
bioiogical source d«eof» wherein said HIV*1 haa ttm folkmbag characteristios: 
(Ji) is suh sla nti ally non-pathogenic in human sulgects; and 

00 cartiea a modilkd ii^geiis iKAddi eooodea a ii^t><K 
20 immunologically non«interactive widi antibodies to amino acids 162 to 177 of 

Nef in vrild-typo HIV.l. 

Amino acids of 162--177 of wld-QrpeHIV-l)^stram QdytnHal^ 1994) (hereinafter 
lefened to as *HIV*1,^)*] are as foUowr. 

25 

Thr Ser Leu Leu His Pto Vd Scr Utt Hr s Gly Md Asp Pko ^ 

This a^ect of the present invendon relates ia part to amaiD add aaqneaoe SEQ ID 
NO:S01 6oinHIV4^3orfiomti»cfbnctionaUy^ 
30 strains of mv-l. 
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A fiitther a^>ect of Ae pttseat uiv«titio& contemplates an isolattd stain of mV-l or a 
biological aource thereof wherdn said HIV-l has the foUowing charaderisties: 
(i) is substantially noa-padu>gesic in hnman subjects; and 
5 (ii) encodes an Nef protein or portion thereof whidi is intoKti^ 

HIV-l Nef antibodies but which is substantially non-intsnctivc with antibodies 
to amino adds 162 to 177 of wild-^ HIV-l Nef pratein. 

StiU another aspect of the present invention relatw to an isrfstodstxdn of ^ ore 

10 biological source thereof which is substantiaUy non-pathogeaic in human wt^jects and 
which is substantiaUy incapable of directing synthesis of aNefpwtein or portioo the^ 

oon^mang amino adds 162 to 177 of wild-type HIV-l Nef pratein. 

In stiU yet another aspect of the present invention, there is provided an i^tated stnin 
15 of HIV-l or a biological source thereof said HIV-l bdng siA«aatial|y aoiHpadiogenic 
in humans and comprising a mutation in its geaome oofre^oadiag to aod^ 
to 177 of wild-type HIV-I Nef such that these amino acidi are aolirtaadally not 
lep t Mcn t ed maNefprotdn or derivath>e t h ereof p t odaeed by laid laoliiidHI^ Himn. 
or insnfiScient of the amino acid sequeaoe is repRseoiled to iflAioe an imnm re^oore 

20 to that r^on of Nef. 

In a related embodiment, the genomic mutatioa in the non-pathogeiue «iain of HIV-l 
is a mutation in oiie or inore of nucleotides 9271 to 9317 relative ID HIV-1m^ or in 

a functionally e({utvalent region hi anodier padiogenie strain of HIV-l. 

25 

In a related embodiment, there is piovided a noo^rthoga»ic iltriii of MV-l COO 
a genome which is siActantially incapable of hybridisinf under modfami «ringent 
oonditiotts a nudcic acid molecule comprisiag that aequeooe ct MKleotidea wfaicii 
encodes all or part of amino adds 162 to 177 of wUd-typa HIV-l. PreAnUy. the 

30 nucleic add molecute is a syn&etie oligcnmcleotide. 
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In a particularly prefistred embodiment, the present inventiM provides aoQ-pilboge&ic 
mv-l isolate CIS deposited at the ECACC on 17 October, 1994 tmkr Pmviiiooal 
Accession Number V94101706. 

5 

In a related embodiment, the present invention provides non-pathogenic HIV*! isolate 
C9S deposited at the ECACC on 31 October, 1 994 under Provisioiial Accession Number 
V941031169. 

10 In another embodiment, the present invention provides nM^MAogeoic HIV*1 isolste 
C54 deposited at ECACC on 10 March, I99S under Provisioiial Accession No. 
V95031022. 

Altfiough padkogenici^ is a relative term, it is uaed herein in fdadoa In the capacity of 
15 a stnun of HTV-l to induce Ams or Ams-retaied disorders hi aa^^ 

Accordingly, a "non-pathogenic" strain of HIV-1 is a strun whadi, at Ac dinical level, 
does not lead to tf»e devdc^ent of AIDS, at katt vv^^ 

following infection with HIV-1. At the labocatory level, n m m- p t/ llMtrmr sUnin of 
HIV*1 is consideicd not to alter C04^ cell counts or ^-mkioglobulia coooeoMtiaDs. 

20 In addition, a non-pathogenic strain of HIV- 1 mqr not alter OM^ and CDS^- cell counts 
and would not aher lymphocyte counts. CD4^:CDS'^ latioa also icoMm fi c faan ged 
relative to nomud non-infected individuals. VrufbimoMt^ gfit^^ 
strain of HIV-1 does not induce p24 antigenaemia. A nott-padbogeofe HIV-1 of tibe 
jmsent invention is generaUy sdll infectious but individuala faifeciad with Ihe vkus 

25 remain fice of symptmns for at least 6-10 years after infecfioa. 

• 

A laborat<»y dasstfied non-pathogenic strain of HIV-1 msgr ba determined at any time 
after infection. The term **non-padiogenic" is not to be considered as a strain tfiat is 
never pathogenic under any conditions as this wif^ depend on the host Individual, the 
30 level of immune reiqxmsiveness in that individual and the eadeat or otewiae of od)Mr» 
for example, inunune oon^msng disorders. AooonUngly, a *noii-patfiotattC" HIV-1 
isolate of the present invention may also be considered a "low vinikiit* itrun of die 
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vims. A non-padiogenic strain of HIV-I as contanplated heceta may be isolirtiDd from 
aa asymptomatic individual or may be derived firom a patfiogMfc strain by mutation. 
Altiiough the present invention is not to be limited to any particiilar petfiogenic strain 
ofHIV*!, for reference pufposes, an example of a patboe^nicstnu^ strain 
5 as described by Myers et at (1992; 1994). 

The non-pathogenic nature of the HIV-1 of tfie present inveotioii is convenieady 
evidenced by the cohort of seven individuals coraprisiQg one donor and six redpieats 
vriiich have remained free ofsymptoms or signs of HIV-1 inftctioo for greater than the 
10 median time of 6-10 years. However, the individuals of the cohort are seropontive for 
mv-l following infection with tiie virus as drtrrminrd by Wertera hloC analysis and 
genetic analysis (e^. ufing PGR tedmiques). A aeropositive infividual ia one showipg 
reactivity to at least one mv-l glycoprotein (such as but not ttnted to gp41-4S. gpl20« 
gpl60) and at least Auree oAer virus-^ecific bands. 

IS 

In accordance whh die present invention, a noa-paAM^genicH^ iaolaaa is also a strain 
of mv-l which: 

(i) induces an immune rehouse in a human or primate au b j a ct; aaid 
Out) does not substantially reduce ptKriOMb^ative 
20 mitogen, alloantigen and/or recall antigen relative to a heahfay, non-infected 

subject 

Pteferably, the inunune response such as to a glycoprotein, for example gp41-45, gpl20 
and/or gpl60. PreforaUy, Ae ^toktne montoed is an iiKlerintlrii^ aoch as IL-2. 
2S Preferably, the recall antigen is influenza (»r tetamtt toxoid. A aopiyadingrme HIV-1 
isolate b also one which: 

(iii) does not substantiaUy alter proliferattva reqxMkses or cytoldna production to 
allorgeneic mononuclear cells. 

30 Furtherm(»e» a non-pathogenic strain of HIV*1 canies a ddelion hi aa HIV-l-derived 
protein vMch results in an altered immunological profile. In a partiouterly p r efer red 
embodiment^ die non-paAogenic strain of HIV-1 is substantiaUy inc^palila of inducing 
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an antibody response to amino adds 162 to 177 to wild-type HIV-I Nef ptotdn. 
According to this preferred aspect of the present inventioa. ibut u provided a noo- 
patfaogenic HTV-I isolate wfaich: 

(i) induces an immune response In a human or pnmaie 9±$eet; 

00 <l<>es not substantiaUy reduce proliferative leapooses or cytokme production to a 
mitogen, alloantigen andA>r recall antigen relative to • haHhy, aon-infectod 
subject; and 

(iii) is substantially incapable of inducing an antibody rtspooae to acids 162 
to 177 of wild-type HlV-1 Nef protein. 

The genomes or complemeitfaty DNA therefrom of the nooipatfiogeaic HIV-I isolalBS 
of the present invention are c^mble of hybridising under mediuni stiii«eacy OMiditioas 
to die conespottding genmie or oonqrfcmentaiy JDNA of a p adiogeai o straiB of HIV-1 
(e^g. HIV-l stram mV-l^^. The ability to hyfacidiiB to a paAofteaie sinin of HlV-l 
only applies to a comparison of the entire geaome/comphmmtary DNA of a noo- 
IMthogenic strain or a fiagment which indudes genetic mslMial cofwipoad ia g to a 
regtoo in the genome 3' of die N^geae in a padtagBoie stidn of HIV-l . 

Reference herein to "wild-type HIV-r is meant to indode tdbranoe to SNliitypd 
patfaogenie strain HIV-1|^^ (Myera tt at, 1992; 1994). For the puipo«t of icferawe 
onfy. a suitable genomic nudeotide sequence from HtV-1,^ k set ferth m SEQ ID 
NO: 1 (Myen tt td, 1992; 1994): 

SI ATCTACCAGACACAAaaCTiUnTCOCTQIkTTQOC^^ 
101 GCCACyXaTaiGATATCCACTgACCTTTQQ&l Sa tf 
151 CAOTTqAACCMAGCAAOTAQM^aVX^ 
201 TTOITACACCCTATaAGCCMKATaaQ^^ 

251 KnhtrscnQahMTrtQidSfjaccs^ 

301 AOCTGCAT00aaiUnACnACIUJbG»kCTaCTaA^^ 
3 SI G»ACTTTCCXIC1X3GGaACTTTCC!AGnaaU^^ 

401 C»3AQTGGOC»<X:CCTCaaAT(XrrACATATAAQ^ ^ I ' m CCTOT 
4S1 ACTCKSGTCTCTCTGCTTiUSACCAGKrCTaJUK^ 
501 ACTAGaOAACCCACraCTTAAa<XTClUk.TAAMX»naCCT^^ 
551 AA6TAOTGTOTCK:COGTCTQTTGTaTGACT<nxxn'AXCTJUS^^ 
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601 AGACCCTTTTAGTCACnX?rGGAAAATCTCTA0aumSGOGmX 
651 GACTTGWU^GCGAAAGfTAAAOCCIWJlGX^^ 
701 GCTTGCTGAA<XJG0GK3iaXK3UVG»C^^ 
751 C0CCAAAAATTTTCACTAGCGGAGGCTJU3JUUX»^^ 
5 801 AOCGT<XXnATTAAQCGKXKX3JiaAATTAGATAAATOC3^^ 

851 TAAGOCCA<XWGGJUUVaAAACAATATAAACrrAAAACATATA(nATC^^ 
901 AGaiGGGACK7rAGaUlCX»TT0GCAGTTAATCCr^^ 
951 AOAAGGCTOTAOAaUUlTACIXXSGACAOCTACAAC^ 
1001 GATCAQAAGAAClTAGATCATTATATAATA<3ATAaCAOTC^^ 
10 1051 GlXK»TCAAAGGATAGATOTAAAAGJM3ia»AaQAAOC^^ 

1101 AOASOAAGAGaUUUkCAAAAOTAAGAAAAAOOCACMO^^ 
1151 ACAC»C3GAAACAACAGCCAGCnPCAOCC3UUUlT^^ 
1201 CTCCAGfG<XK3UVATO(nACATCRaOCCATATC»CCt^ 
1251 ATCXXn-AAAAGTAitnaSAAfiAGAAOOCTTTCM^^ 
15 1301 TGnTTTaUXATTATCMAAGWaM^^ 

1351 CTAAACACACnX3G0GXXlACATCAAaC»OOCAT0CAAATOT^ 
1401 CATCAATOAOGAAGCTGCACSAATCXX»TAaATK!^^ 
1451 CXXXrTATTOCACCAGKXXaUjATGAOAaAACCa^^ 
1501 OGAACTACTAGTACCCTTCAaOAACAAATAaaATO^^ 
20 1551 ACCTAT0CCAarA0aJW3AAATCrATAAAAaA«l»^ 

1601 ATAAAATAOTAAGAATQTaiTASCXXBCACXaGCKrTCTqQ^ 
1651 QQACC!AAAOGAACOCTTTAflACACTATGTI^GACCIC W^ 
1701 AAGAaCOGJAOCAAGCTTCACAAaAOOTAAAAAATrGaAl^^ 
1751 TGTT<MriXX3UUUlTGCXaUlC!CCAC»^ 
25 1801 GGACCakOCaAGOaACACrAGAAGAAATaATaACAaCMOTCMCWSACTGGO 

1851 aac»CX:CXXXXATAAAOCAAGAOTTTTGGCTGAAflCAATmflOCAA<rrAA 
1901 CAAATCCMCTACCATAATOATACAOAAAOGKaAITTTAOaAACCAAAaA 
1951 AAGACTG^^AA tflW r i ' ^ ^ :A ATTQTOGCAAA Q AAQQOCACa»A<lCCAAA^ 
2001 TT<K3«JOaCCCCTAOGAAAAAOaO«»TTOGAAAT^^ 
30 2051 ACCAAAT<aUUtf»TTOXACTCUWSlMaACAOOCT^ 

2101 TaaoCrT<XXACAAOOaAA<K3CaU80GAATTTTCTra 
2151 <XXJUlC3UXXXXavCC»aAAOAflAOCTTC3UX^^ 
2201 CTCCCTCrCAaAAGC3USGAOCO(»TAaACAA0GAAClVlAl^^ 
2251 TCCCTCAGATCACTCrTTaXaWGKX^ 
35 2301 GGGCAATTAAAGGAAGCTCTATTAflATACAGW»lClCWATG»^^ 

2351 AGAAOAAATGAATTTGCCAOOAAGATOOAAACaAAAATOATAOaOGOAA 
2401 TTOGAGGTrTTATCAAAGTAGOACAaTATGATCaaATACiaffAaA^^ 
2451 TGCOGACATAAA<3CTATAOOTACawG?rATTAOTAiK»CCTAC^ 
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2 SOI CJlTAATTGGaUiaAAATCTOTTOACrcaUSATTa^^ 
2551 CCATTAGTCCTATTC»GACTGTACaU?rAAAATTAAA«X^ 
2601 eXXXXaUVAJUnTAAACRATOGCCATTCACA ^ ^ 

2651 A(n*SAAXTTTGTAC3W3AAATGOAAAA(XaAGO*AAAATTTCaAAAAT^ 
5 2701 GGCCTGaUUUirCCATACa^TACrOCAGWATTIWXAW 

2751 AOTACTAAATGGAGAAAATTAGTAGATTTCACSaVGAAC^^ 
2801 TaUM5ATTTCI<W3GAAGTTCAATTA<XJAATACaiCATCC^^ 
2851 AACAGAAAAAATCAOTAACAOTACTOGATOT<XWa»^ 
2901 GrrTCCCTTAOATAAAOACTTCAGGWUnATACTOCATTTA^^ 

10 2951 TATAAACAAT6AG»lCSlCCACK3GATTIU2ATATCJyaiACAAT^ 

3001 AGGGATCSaAAACX»TCACC3«KaUlTATTCCai^^ 
3051 TTAGACKXTTTTTAGAAAACSUUATCCaiGaVCAT^^ 
3101 GC^TOATTTOTATGTAGGATCTGACrTAaAAATllGKWC^^ 
3151 AAATAOAOaAACTSACSaiCAACATCTOrTC^^ 

15 3201 GaiCAAAWUlCATCaaAAAflAACCnXATTCCmOaATCPaBr^^ 

3251 CCATCCKaTAAA«XaACAOTAClU3CCTATJ^^ 
3301 GCTCmCTOTCAATGACATACaaAIUlTT^^ 
3351 AGTCAOATTTATaaUSGCATTAAAinAAGWCAATTATtn^ 
3401 QQQAACCAAAOCiACTAAClVaMWSTAflTfcCCACTAACiy GA^ 

20 3451 TAOAACItXlCAGAAAACakOCXUWGUMRGXAAAA^ 

3501 rKTrArokcccfaai»3M» CT n^ 

3551 AOOCCJUlTOGyiCATATCMATTTATCMOAOCCATTTAA^ 
3601 CAOOAAAATATSaUlQAATGAAGKXrTaCCC^ 
3651 TTAACASAaOCASTACAAIUVAATAOaSUaXJ^^ 

25 3 701 AAASACTCCTAAATTTAW^TACCaiTACAAAWGAAACAtOCK^^ 

3751 GCnXiaAaUSaUSTATTtKKAAOOCACCTaGATTOCT^ 
3801 AATACCCCTCCCITAOTGAAGriTATCaQTACX»qTT»^ 
3851 AATAGGAOCAOAAACTTTCnrATGTAGATaaoaCAaCCAATftaOGftAACTA 
3901 AATTACK3]UUUiGCAOaATAT<nAJUnx;ACAGAG^^ 

30 3951 CCCXnPAAOCKJACAaACAAATCAaAAaACTaAaTTACAAO^^ 

4001 AGCTTTOCMaATTCOaGATTAaMMTIUUiaiTAOT^^ 
4051 ATOOLTTGGGAATCAlTaUlGCAaUiCCAG^^ 
4101 OTCAQTCAJUlTAATAGAQGAOTTAATAAAAAJiOGAAAAMII^^ 
4151 ATOGQTACX3^GCACACIUUlOG»ATTqGAOGAAAt WA CAAaTAgATGGGT 

35 4201 TGGTCAOTOCTCGJUiTCACWAAAOTACTAT^^ 

4251 GOXAAGIUWSJUlCATQACUUUiTATaiCA^ 
4301 TOArrTTAACCTACXiJ^CrGTAGTAOCAAAAGAIUlTAOTAaCC^ 
43S1 ATAAATOTCAGKnAAAAOaCX^AAGCCATaCAT^^ 
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4401 CCACSGAATATGXKAOCTAGATTGTACACATrTAGAAaGaUUUUOT^ 

4451 GK3TAGCIUnTCATGTAGCC»(rrGGATATATAiGUUUSC^^ 

4501 CAOM^ACAOGKSCAAGAAACAGKSlTACTTCCTCn^ 

4551 TGQCCMTXMAkCA0TKCM7iCAaACK^^^ 

4601 TACAGTTAAGGCCXjKrCTGTIXXriXXSGOOGX^ 

4651 TTCCCTAaUlT(:X:CCAAAGTCAA0aACn»ATAC3AATX^^ 

4701 TTAAAGAAAATTATAGGAGAOGTAAGAGATCAGaCTGMACATCTT^^ 

4751 ASCAOTACAAATGGCAinATTCATCCACAATTIin^^ 

4801 TTG0aGGOTAC»(nX3CAGGaGAAAG»ATA6TAG»C»!n^ 

4851 ATACAAACTAAAGAATTACAAAAACaUkATTACAAiUUlT^^ 

4901 GGTTTATTAC3lGKK2AaUX3lGAaATC^^ 

4951 T<XrrCIGaAAA0(nt»AGG»aCA0TAGTAATAC2^^ 

5001 AAAGTAGTOCCAAiSAAaAAAAGCaUUWQATaiTaUXXM 

5051 GATG6CAGGTGAlX3ATTOTGTGOCJU^AGAC»aGATa»^ 

5101 TQGAAAAGATTAiGrrAAAAaiCCATATGTATATTTGIUyOa^^ 

5151 CTt3! Gm T A TAGACATCACl»TGAAAGTACTAATC C JtfU U MgAitflI^^ 

5201 AAiOTACACATCCCACTA»3GGATGCIAAATTAffrAATAAC^ 

5251 aOTCTaCATACAGKSAGAAAOAGACTGGCATTTCKaai^^ 

5301 AGAATGGAOGJUUUUkOAaATATAGCSlCACMUSTM^ 

5351 ACCAACTAATTCATCTOCACTATTTIGATn^^ 

S401 hOMXTACCKTMThaakCmhThaTeUf^^ 

5451 AGCU^<»TAACAAa6TAaaATCTCrAClUn-ACIT(^^ 

5501 TAAAACC»AAACA(lATAAAOCCACCTT7<3K»TAOTOTTA^^ 

5551 GAOGkCl^TQQPACMOCCaMSAkB^^ 

5601 TAOUlTGAATOGACACTAfSAOCTTTTAaAG 

5651 TASACATTTTCXrTAlCXUlTATGGCTCCATAACTIAia^^ 

5701 AAACTTAOGGOGATACTTGOQCAaaAGTGaAAGCCaiTIUaAIUS^ 

5751 CAACAAClX iC TOrrr A TCCATTTCMMlT^^ 

5801 AOGC(rTThCTC<SA!CM3MaeSh^^ 

5851 CTAGIiaCCXrrOGliMCKrCCtiQ^^ 

5901 TiXIClATTVlTAAAAAilTOTlXjCTrL'CATlXICCAAUl 1 IGl'TICATQACAA 

5951 AAOCCTTAiaqCATCTCCTATGOCAGqAAGAAQOGQA^^ 

6001 CKTlXaiTCAOJUlCAOTCAQACTOiTCAAQCITCf^ 

6051 AOTACATGIIUITGCAACCTATAATAOTAGCAATI^AG^ 

6101 OUlTAATAATAOauiTAGTTCrraxaGTCCATAaT^ 

6151 AAAATATTAAaACAAAaAAAAATA0ACAG0TTAATT6ATAaACTAKTA<^ 

6201 AAOAGCAaAAOAC3iari<3GK:AA1X»WGUW9TGUU^^ 

6251 TOGU^TGOCKKrrCKSAAATGaOCCACCATGCTCCTTGO 
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6301 CTCriAOK^n'ACAaJU^AAATTGTGGGTCACAGTCTATTXTX^^ 
63S1 TGTGGAAGGAAGCAACX^CCACTCrATTTTCrKKIAT^^ 
6401 TATOATACAGAOOTACATJUVWmTrCXWCCACACXTGKX^^ 
6451 AaACCCCAACK«ACAAGAAOTAGTATTaGTAAATOTOACA<»AAATT^ 
5 6501 ACATSTGGAAAAATGACATCX»AGAACA£»TGCMnMaATATAA^ 

6551 TrATCX»ATCAAAGCCTAAAGCCA«nOT»AAATrAA«XX^ 
6601 TAGTTTAAA(mK»CTGATTTGAAaAAXCATACTAATACC»ATA01»^ 
6651 GCXSGGAGAATCATAATGKJAGAAAGGAGAGATAAAAAACTtXnCTTTC^ 
6701 ATC3VGCAC3WX31TAAGAOATAAOOT(X AG AA<W^ 
10 6751 TAAACTTOATATAOTACCAATAGATAAXACCaOCWTAOaTTGATAAGTT 

6801 OTAACACCTCAGTCATTACACAGKXrCTOTCCAAAOCTATCXn^ 
6851 ATTCCCATACATTATTGTOCCCOOOCTWnTITOOaATT^^ 
6901 TAATAAGAOGriTCAATOGAACAGX»(XATGTACAAATOTC^^ 
6551 AAlOTACACATGCaUlTCAOOCaVGraWSTATCJ^^ 
15 7001 GGCAGTCT1tfKaiSAAGAAGAT0TMT3iAT^^ 

7051 CAATGCTAAAACCATMlTAGTACAOCrGaAC^ 

7101 GTACAAQACCCAAaUlCAATACAAaiUUUl^GTATa«TATOCAQMK^^ 

7151 CCAGK»SIU3JUX3lTTTOTTAGAATAaa^^ 

7201 ACATTOTAACaiTTAOTAaAOCAAAATOaAATCCCAC^^ 

20 7251 CTAOCAAATTAAfiAQJUiCAATTTOaAAATAATAAAACAA T fcATCT^ 

7301 aUlTCCTCAOGAGOGnACOCASIVAATTGTAAaiClkC^^ 
7351 AGKXKSU AAUTAUXj l A CTCTAATTCAAaiCa U i CW^ ^ 
7401 TTAATAGTACTTGGASTACTGAAlCXiGrrCAAATAACACTaA^ 
7451 ACAATCACACTCXamKMAAaaUUUiaUVTra^ 

25 7501 AGTAGQAAAAOCAATOTATGCCCCTCCCATCASXOC^CAAATTA^^ 

7551 OlTCaUATATTAClXXXXnXKrrATTAACAAGIV^ 
7601 AATOOOTCXXJAaATCTrCAGACCTGGaiGX^^ 

7651 GAOAAOTGAATTATATAAXTATAAAOTAGTAAAAATTOMOCATTMIOAG 
7701 TAGCACXX3^CCAAQQCAAAGAGAAGAOTGGTGCgU» fl A(»AAAAAaAQ^ 
30 7751 <H«G6AATA0QA0 CTAnVl " I XXTrGQ0TTCTT 0 qGACW M C^ i aAAgCAC 

7801 TAT(XKKn<lCACGTCAATaA(X3C«»CXXn^ACAa^^ 
7651 CTOATATAGlXKSVGCMXaUSAACMiTTTaCT^^ 
7901 C»aCKrCt<nTQChKCKACMTC^^ 

7951 AATCCTaaCrGT«!AAAGATACX:TAAAaGATCAM»OCTC^^ 
35 8001 aoaGTTGCTCTGGAAAACTCATTTWaiCCAC^^ 

8051 AGTTaaAOTAATAAATCTCTGGAACAaATTTGGJUlXAACATt^ 

8101 GaAOTGGaAaUSAGAAATTAACAATTACACAAOCrT^ 

8151 TTGAAGAATCXSCAAAACCAaGAAGAAAAOAATQAACAAaAATTATT^^ 
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8201 TTAGATAAATGGGCAAGTTTGTGGJUITTOGTTTAAC^TAAC^ 
8251 OTGGTATATiUUU^TTATTCATAATOATJWSTASaM3(X7r^^ 
8301 G»UlTAaTTTTT(KrnnACTTTCTATAGTGAAT^^ 
8351 TCACCATTATOGTTTCAGACCCACCTCCCAATCC^^ 
5 8401 GCCCGAAOaAATAGAAGJU^GAAOOTQOAaAGAOMSAaU^^ 

8451 TTCaATTAGrrGAAOXUTCrTTAGCACn^ 
8501 CTGTGCCTCnCACXrrACCACOCSCRXSASAGUlCTTACTCTTU 
8551 GACKATXtnCGAACTTCTOGGACOOU^^ 
8^01 qOTOQAATCrrCCTACAOTATTQGAOK^^ 
10 8651 AACTTGKn-CAATGKXACACKXATAOCACrrAOC^^ 

8701 TATASAAOTATTAOUVSCAGCTTATAGAGCTATTCGCC^ 
8751 GAATAAGACAOOCKniXSaAAAGOATTTTGCIATAAai^^ 
8801 OTCAJUUUVGTAGTGTGATTGGATCCKXTTGXT^ 
6851 OAOCTGAOaZACKSVGaWQATGGOGTGC^^ 
15 8901 AAACATaCUiOCAATCACAAOTAGCAATACAGCAGtCT^^ 

8951 TGK:CTGCKn!AGAA<XACaUVG»OG»OaAAGAa<m W 
9001 CTCAjGXnACCnTAAOACCAATGACTTACAJ^^ 
9051 CACTTTTTAAAAGAAAA<3K3(XX>ailCICMAAOGHC^ 
9101 AAXaACAAQATATCCTTO A TCTCTqGATCtitf C a^^ 
20 9151 CiaATTGGC»aAACTACACACClVIGKXX3tfS0a(l^^ 

9201 TTTGGATG(nwrrAcauux:Titfna^ocjis^^ 

9251 OOCClUlTAAAGGAGAaJUlCACCIUXniKnTACAO^^ 

9301 GAATGK»lX»<XXrraAOAQAGAACrnnT^^ 

9351 CTAOCATTTCATCACmaQCCOaAGAGCXKXATCOOG^^ 

25 9401 CTGCTGACATCGAGCyrtXrrACAACKaGACA U UVU^ 

9451 acna(3anX36CCTa<XXXX3aACIOGG»JU3^^ 
9501 ATATAAqCAOCTG C T lYriXK :CTOTACTa( »l ^n'CTei'q(^ 
9551 TCTGAOCCTGKXaxXiTCTCTOOCTAACT^^ 
9$01 eAATAAAGCrTGCCTTOAOT G ClTC A AOTACriWa^ ^ 

30 9651 TGACTCT«rrAACTAOAaATa»nX:AGACCCn^^ 

9701 TCTCTAGCai 

However, for the puipose* of oompttlng tbe niinlroti<ir ■equoaou. of nDBiMdiogenic 
mv-l stfaina indoding the ability to hybrid to a leftnaoe mma» tlw pfCMot 
35 inveati<m extends to a geDomkaudbotideaequaB^efiomatqrp 
1. 
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Reference to a biological sotarcc includes blood or blood-rebrtad pcoducte or compraents 
such as lymjAocytes, plasma» tissue fluid and tissue extracts. 

Accordingly, in a particularly preferred embodiment, tfiere is pcDVided a viral isolate 

5 which: 

(i) carries a genooie which is cifMble of hybridising under medium stringency 
conditions to SEQ ID NO: 1 or a compienientafy fixm tfiereof or an analogous 
sequence from another pathogenic strain of HIV-1; and 

(ii) canies a deletion mutation in a region conesponding to tbe mrf gene and/or in 
10 an LTR region. Generally, such an HIV-1 isolale is noiHMitfiogemc as 

hereinbefore defined. 

In a related embodiment, fliere is provided an isolated virus iwUdi: 
(0 has a gnome mldcb is capable of hybridising under akedtmi strlngmcy 
15 conditions to complementaiy nucleic add fimi a patfaogmic stmtn ot WYA; 

and 

00 catrics one or more deletion mutations in n regiott of Us fBOOOiB coeiMponding 
to a tqpon iRAilch contaitis ii^codmg aoqucnoes and LTR m 

20 For tfie purposes of defining iSbc level of stringency^ reference can codEVwico^y be made 
to Maniatis ei al (1982) at pages 387-389 vAdch is herein iiMXHpotated by mfereaoe 
where the washing steps disclosed are considered high stringency. A km strfatgerKy is 
defined herein as bemg fai 1-3X SSC/OJ-O.SH wAr SDS at 37-50*C for 2-3 bom. 
Depending on Ae aoufoe and concentration of nudeic acid involved i^ 

25 ahemative conditi<ms ot stringenqr may be employed ncfei as niadimn atriogaot 
conditions iwhich are considered herein to be SSCy0.2M 
for 2-3 hours or high striiigent conditions considafed herein 
SDS at 60^C for 1-3 hours. 

30 In a particularly preferred embodiment of the present inventiott. the Mn-padliogenic 
strain of HIV-1 carries a mutation in tlie n<^gene and/or LTRrogionof tfke gc&oma. 
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In an even more preferced embodtmat» the non-pathogenic strain of HIV-1 carries a 
mutation in tiie M/gene sudb diat a Nef protein is not produced or a modified Nef 
protein is pnxfaiced substantially not canying amino acids 162 to 177 or a portion 
titereof fiom wild-type Nef. 

5 

A "mutadon" is considmd herein to include a single or multiple micleodde substitution, 
deletion and/or addition. Most preferred mutations are single or multiple deletions of 
at least one» most preferably at least ten and even more preteahty at least twenty 
contiguous nucleotides fit>m a region c on espot ^ tfing to die wtf geot aod/or the LIK 
10 region. When the non-pathogenic viras carries a mutatioii m the LTR region, tfiis 
generally occurs S' of the Spl sites. A particulariy preferred detetiOQ is fiom a region 
within the n^gene correqxmding to amino acids 162 to 177 of the Nef protein. 

Accordtiig to a prelSmed aspect of the ptesem invention, there is pro 
IS which: 

(i) is xeacttve to antibodies to a ^yooprotein fiom HIV-1 such as at kart one of 
gp4MS» gpl20 and/or gpldO; 

01) is substantially non*padiogenic in human stdjiects; and 
(iii) carries a deletion mutation of at least ten nucleotides in a r^ioa corrrjf>ondii^ 
20 to the ntfga^c and/or LTR r^ion of a padiQgenic strain of HIV*1. 

In anotiier embodiment, there is provided a viral isolate which: 

G) is capable ofinductog an iinmunereQ)Ottae to at least ooa of gK« 

(ii) is substantially non-pathogenic in human subject^ and 

25 GiO carries a deletion mutation of at least ten nucleotides in n tesgiam coaayonding 
to the ifcfg^ and/or LTR region of a pathogenic sttain crf'HIV-K 

Pr^srably^ in respect of tiie latter coibodiment^ Ibe immune re^onae is an Mlibo4y or 
a cell mediated response. In a most p re fer red embodiment, tfie snuoamie r e^ w ose is a 
30 humoral immune response. 
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In another aspect of the pieaeot inveatioD, there is provided a viral iaolatB which: 
(i) is interactive to antibodies to a glycoprotein fixMn HIV-I such as at least ooe of 

gp41-45, gpl20 and/or gpl60; 
5 (ii) is substantially non-pathogenic in human subjects; and 

(iii) cairies a deletion mutation of at least tea nuctootides in a ragiooconeqpondtng 

to all or part of amino adds.162 to 177 encoded by die n^gcna of a padwgenic 

strain of HIV-1. 

10 The nucleotide sequence of the nr/gene in mV-l^^ is defined hi SEQ ID NO: 650: 
ATGG6T00CAAGTG0TCIUUUUU7rA<IT<yrGXTXtXaAT^ 

15 QliCTrXCMaaCAGCTOTI^TCTT^^ 

6GCTAA.TTCACTCa:JUUtf3AAa2iaUUakTATCCT^^ 
aGCTACTTCCCTGATTKXSCACSAACTACA^^ 
CTTTOGATaaTOCTACIUUXn'AGnUXaWC^^ 
AAOaAaAOAACACaU^CTTGITI^^ 
20 AOACUUl6TOTTAGA(n«3AGG»rTTaACIVGCCOCCT 
GCTOCATCCXXSAGTACTTCAAGAACTOCrrGA 

The nudeotide sequence encoding amino adds 1(S2-177 of ivOd-type HIV-Imi^ Nef is 
as follows: 

25 ACCMXnTGTITACACXCTGTCAaCCTOCAT^^ [SSQ ID 

MO: 802] . 

The present inventioa extends to any or all sinfl^ or mul^fe wrk otid r ddbtiona to a 
contiguous series of at least ten nucleotides from tbe urgent which laader the strain 
30 avttuleoL Hie deletiooa may eooonyass tfie cntiie gene or parts Aaeeof and may 
rq>resent a ringle deleti<m or two or more ddettoas. Put in ahemathw terms, the non- 
pathogenic mV-l isohrteaoftibe present inventfon comprise a aneleoddaaeqneBoeat die 
oorrespmidfaDg m/gene re^on ncmrideatifiable to SEQ ID NO: 650, aaid noo-ideiitity 
conqviang at least 5%, more preferably at least 10% and evea mora pRfiBraUy at least 
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20% variation thereon. The present invention particularly extends to any or all fingle 
or multiple nucleotide ddetions to a contiguous series of at le«t ten nucleotides from 
the region of the n^gene corresponding to «U or part of amtao acids 162 to 177 and 
which render the strain avirulent Although this aspect of the piCMBt invaUioo is 
exemplified by specific reference to amino adds 162 to 177, the pcesent invention also 
extends to a functionally analogous sequence where, for exmnpic, amiao adds are 
substituted by structurally or functionally similar amino adds. In additioa. die pcesent 
Invention extends to any deletion or oAer mutation in an HIV-I derived pnMdn which 
renders dut strain non-pathogenic. 



In a preferred embodiment, tberefofe, die present inventioB eonteo^lates a 
viral isolate which: 

(i) is reacdve to antibodies to a ^ooproCein from HIV'I audi as at leatt one of 
gp41-4S, gpI20 and/or gpl60; 
IS (ii) carries « genome or a part or fiagmeat thereof capable of hybridising under 
medhim stringency conditions to a nudeodde sequeaee as set fimfa in SEQ ID 
NO: 1 ore complementary form diereof Of an analogOMi a eq u aae e fiwnaaodier 

padiogenic strain (tf'HIV-1; 
C«0 curies a ddetton Of at least tea nucleotides in a region eooMpoi^ 
20 gene in HlV-lyn^j; and 

M^ierein said deletion encompasses one or more of die foUowinf deomdeotides from 
die fie/gene of HIV-1,41^3 or coiTei!)onding tettaeocet from aaodMr pdhoteaic tfrain 
offflV-l: 

A.TQGGTGOCft(SBQ ID NO; 3); TGK3aTa6CAA(SBQ ZD HOt 3) ; 
25 OaOTaGCAAG(SBQ ID NO: 4); OOTOOCMUSTCSBQ ID NOs 5) ; 

GTGGCAA6TG (SEQ ID NOt 6); TOOGAAaTOG (SBQ XD MOt 7); 

OGCiUkOTOaT (SEQ ID NO: 8)1 OCJUUSTOaTC (SBQ ID MOt 9) 1 

CMGTGSGTCA(SBQ XD NO: 10) ; AA<IT0GTCAA(81Q ZD NOt 11) ; 

AGTGGTCAAlKSBQ ZD NO: 12); GTOOTCMAA(SBQ ID HOt 13); 
30 TaOTCIiAAAA(SBQ XD NO: 14); aaTClUUUiA6(81Q ZD IfOt 15); 

OTCAAAA]lST(SEQ ID NOt 16); TCAAAAAGTJl(SBQ ZD MOi 17); 

CAAAAAGTAG(SBQ ID NOt 19); AA2JUIGTAGT (SBQ ZD MOt 19); 

AAAAlSTAOTG(SEQ ID NO: 20); AAAaTAOT6T(SBQ ZD MOt 21); 

AAOTAOTGTOCSBQ ID NO: 22); AOTAGrGTaA(SBQ ID MOt 23); 
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GTAGTGTGAT(SEQ ID NO: 24)) 
AGTGTGATTG(SEQ ID NO: 26); 
TGTGATTGCA(SEQ ID NOr 28) ; 
TQATTGGATO (SEQ ID NO: 30); 
5 ATTGGATGGCiSEQ ID NO: 32); 
TGGATGGCCT(SEQ ID NO: 34); 
GATGGCCTGC(SEQ ID NO: 36): 
TaOCCTGCT6(SEQ ID NO: 38); 
GCCTGCTGTA(SEQ ID NO: 40); 
10 CTGCTGTAAO (SEQ ID NO: 42); 
GCTGTAAGGG(SEQ ID NO: 44); 
TGTAAGGGAA(SEQ ID NO: 46); 
TAAOOOAAAa (SEQ ID NO: 48); 
AOOGAAAGAACSEQ ID NO: SO); 
15 aaAAAQAATO(8EQ ID NO: 52); 
AAAGAATGUUXSEQ ID NO: 54) ; 
AGAATGAGAC(SEQ ID NO: 56); 
AAT6AGACGA(seQ ID NO: 58); 
TOAaACGAOCCSEQ ID NO: 60); 
20 JU3AO(SAGCT6(SEQ ID NO: 62); 
A0GSM3CTGA!G(SEQ ID NO: 64} ; 
aAaCTaAOCC(SBQ lO NO: 66); 
GCTGAGCX»(3(SEQ ID NO: 68); 
TaAaCCAaCA(SEQ ID NO: 70) ; 
25 AGCCAOCAOCCSEQ ID NO: 72); 
CCAGCAGCAG (SEQ ID NO: 74); 
A(3CA6CA6AT(SEQ ID NO: 7fi) / 
CA0<:AGATG6(SEQ id NO: 78); 
(X:AGA.TGOaO(SEQ ID NO: 80) ; 
30 AGATaaa(?rG(SBQ id NO: 82} ; 
AT6(3CiGTOaO(SBQ ID MO: 84) ; 
a(300TGG6A6(SEQ ID MO: 86); 
GCrrOGGAOCA(SEQ ID MO: 88) ; 
TaaaAi8<»jST(SEQ id MO: 90); 
35 G6A!SCAGTAT(SEQ ID MO: 92) ; 
A(X:A0TATCT(SEQ id MO: 94) ; 
CA6TATCTC0(SEQ ID MO: 96); 
QTATCTCGAQISBQ ID NO: 98); 



-31 - 

TACWGTGATT (SEQ ID NO: 25) 
GTGTCSATTGa (SBQ ID MO: 27) 
GTGSATTQGAT (SEQ ID NO: 29) 
SATTOGATCO(SBQ ID MO: 31) 
TTOaATOGCC (SEQ ID NO: 33) 

OGAraaccraiSBQ id no: 35) 

ATGaCCTOCT(SEQ ID NO: 37), 
OGCCTGCr6T(SeQ ZD NO: 39) i 
CCTGCTgTAA(SBQ ID NO: 41) ; 
TGCTGTAA86(SBQ ID MD: 43); 
CT6TAAaaC».(SBQ ID NOt 45); 
GTAA0(X3AAA(SBQ ID NO: 47); 
AAQ<3aAAA0A(SBQ ID MO: 49) ; 
GGaAAAaAAT(SSQ ID MO: 51); 
OAAAaAATGA(S8Q ID MOr S3) f 
AAGAATOAOACSBQ ID MO: SS) ; 
OAATCnaAOO (SBQ ZD MOt 57); 
ATGA6ACGA&(SBQ ID MO: 59); 
aA0ACX3A<Krr(SBQ ZD MOt 61); 
OAOCaAOCTGUKSBQ ZD MOi 63) ; 
COAOCTOAOCCSBQ ZD MOf 65); 
AOCTaUkSCCA(8BQ ZD HOt 67) ; 
CT(3AaCCAGC($B0 ZD MOt 69} ; 
OAOCCAOCAO (SBQ ZD MOt 71); 
GCCAS€AGCA(SBQ ZD MO: 73) ; 
CAQCAGeCAGA(SaQ ZD MOt 75); 
OCAOCAaATOCSBQ ID MO: 77) ; 
AGavaATGQO{SBQ ID HQ: 79); 
CAC».T00(»;T(SEQ id MO: 81); 
6AT00(S6TGG(SBQ ID MOt 83) ; 
T0aa(ITa(3QA(SBQ ID NO: 8S) ; 
G(}C»TaQGAOC(SBQ ZD MO: 87} ; 
OT<3QaAOCA6(SBQ ID MO: 89) ; 
OOCSAOaVjOTACSIQ ZD MO: 91) ; 
GAOCMTATCCSBQ ZD MO: 93); 
(X3^ATCTC(SBQ ZD MOt 95) ; 
A6TATCTCGA(SBQ ZD MO: 97) ; 
TATCTa»^(3A(SBQ ID MO: 99) ; 



ATCTCGAGAC (SEQ ID NO: 
CTCGAGACCTCSEQ ID NO: 
CGA6ACCTAG (SEQ ID NO: 
AGACCTAaAA(SEQ ID NO: 
5 ACCTAGAAAA(SEQ ID NO: 
CTAaAAAAAC(S£Q ID NO: 
AGAAAAACATCSEQ ID NO: 
AAAAACATG6(SEQ ID NO: 
AAACATGGAO (6BQ ID NO; 
10 ACATGGAGCA (SEQ ID NO: 
ATGGAGCAAT(SEQ ID NO: 
OGAGCAATCA (SEQ ID NO: 
AaGAATCACA(SEQ ID NO: 
CAATCACAAO(SEQ ID NO: 
15 ATCACAAGTA(8BQ ID NO: 
CACAAOTAOC (SEQ ID NO: 
CAAGTAGCAACSEQ ID NO: 
AaTA9CAATA(SEQ ID NO: 
TAOCAATACA(SBQ ID NO: 
20 GCAATACAOC(SBQ ID NO: 
AATACAGCA0(8EQ ID NO: 
TACROCAOCTCSBQ ID NO: 
CAaCAGCTAA(SEQ ID NO: 
GCAOCTAACA(SEQ ID NO: 
25 AGCTAACAAT($EO ID NO: 
CTAACAAT(3C(SEQ ID NO: 
iVACAATQCTG(SEO ID NO: 
C)VATGCTOCT(S£Q ID NO: 
ATGCTQCTTO(SBQ ID NO: 
30 6CTGCTTQT9(SEQ ID NO: 
TOCTTGTGCC(SEQ ID NO: 
CTTGTOCCTG(SEQ ID NO: 
TGTGCCTOGCCEBQ ID NO: 
TOCCTGGCTA(SEQ ID NOi 
3S CCTaaCTA6X(SBQ ID NO: 
T6GCTAGAA0(SEQ ID NO: 
GCTA6AAOCA(SEQ ID NO: 
TA6AAGCACA(SEQ ID NO: 
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100); TCTCGaW3ACC(SBQ ID NO: 101); 
102); TCOAGACCTACSBQ ID WO: 103); 
104) ; GAOACCTAOAISBQ ID HO: 105) ; 
106); GACXrrAGAAA(SBQ ID TO: 107); 
108); CCTAGAAAAA(SEO ID MO: 109); 
110); TAGAAAAACA (SBQ ID MO: 111); 
112) ; GAAAAACAT6(SBQ ID TO: 113) ; 
llf) ! AAAACATOGACSBQ ID TO: 115) ; 
116); AACATGGSAaC(SBQ ID TO: 117); 
118); CATOGAGaVA(SBQ ID TO: 119); 
120); TGOAOCAATCCSSQ ID TOt 121); 
122} ; GAGCAATCAC(SBQ ID TOt 123} ; 
124} ; GCAATCACAA(SBQ ID TO: 125) ; 
12$); AATCACAAGT(SBQ ID TO: 127}; 
128); TCACAAOTAQ(SBQ ID TO: 129); 
130) I ACAA6TA6CA(8BQ ID TO: 131) ; 
132) ; AAOTA<3CAAT(8BQ ID TOt 133) ; 
134); GTAOCAATACfSBQ ID TO: 135); 
136} ; AC9CAATACAG(8BQ ID TO: 137); 
138) ; CAATACAC9CA(SBQ ID TOt 139) ; 
140) ; ATACA8CA0C(8BQ ID TOt 141) ; 
142) ; ACAOCIWaCTA(8BQ ID TOt 143) ; 
144)/ AOCMSCTAACCSSQ ID TOt 145); 
146) ; CAaCTAACAA(8BQ ID TOt 147) ; 
148); GCTAACAATG(8BQ ID TOt 149); 
150); TAACAATOCT(SBQ ID TOt 151); 
1S2); AOUITOCTGCCSBQ ID NO: 153); 
1S4) ; AATQCT6CTT(SBQ ID TOt 15S) ; 
156); TGC1QCTTGT(SBQ ID TO: 157); 
ISe); CTQCTTGTOCCSBQ ID TO: 159); 
160) ; GCTTGIGCCT(SBQ ID TO: 161) ; 
162); TTGTQ0CTG6(SBQ 10 NOt 163); 
164); GTGCCTOGCTCSBQ ID TOt 165); 
166); GCCTOQCTAO(SBQ ID TO: 167); 
168) ; CTaaCTA8AA(SBQ ID TO: 169) ; 
170) ; GGCTAOAAGCCSBQ ID TOt 171) ; 
172); CTAOAAOCACCSBQ ID TO: 173); 
174) ; AGAAfiCACAA(SBQ ID TO: 175) ; 



GAAGOICAAG (SEQ 10 MO: 
A<SCACAA6AG(SEQ ID NO: 
CA.CM^GA66A(SEQ 10 NO: 
CAAaAGGAGQ(SEQ ID NO: 
5 AaAOGAGGAA(SEQ ID NO: 
A.GGAOaAAGA.(SEQ ID NO: 
GAOGAASMCKSEQ ZD NO: 
GGAAGAGGT6(SEQ ID NO: 
MOUGGTGQQiSEQ ID NO: 
10 GACGTGGGTT (SEQ ID MO: 
GGTGOGrrTTCSEQ ID NOt 
TGGGTTTTCCCSBO ID NO: 
OGTTrrCCA6(SEQ ID NO) 
TTTTCCAOTC(SEQ ID NO: 
15 TTCCAGTCAC(SEQ ID MO: 
CX»aTCAaU2(SEQ ID NO: 
AOTCACACCTCSEQ ID NO: 
TCAaiCCTCA(SEQ ID NO: 
ACACCTCAjC36(SEQ ID NO: 
20 ACCTCA0OTA(S£Q ID NO: 
CTCAjOOTACCCSBQ ID NO: 
CAGGTACCTT(SBO ID NO: 
GGTACCTTTACSEQ ID NO: 
TACCTTTAA3(5EQ ID NO: 
25 CCTTTAAGAC(SEQ ID NO: 
TTTAAOACCA(SEQ ID NO: 
TAAOACCAAT(SB0 ID NO: 
AGACCAATGA(SEQ ID NO: 
ACCAATGACT(SBQ ID NO: 
30 CAATGACTTA(SEQ ID NO: 
ATOACTTACACSBQ ID NOt 
aACrrACAAO(SEQ ZD NO: 
CTTACAAaaC(SEQ ID NO: 
TACAAOGCAGCSEQ ID NOi 
35 CAA06CA0CT(SEQ ID NO: 
A0GCA6CTGT(SEQ ID NO: 
OCAOCT6TAG(SEQ ID NO: 
AOCTOTAaAT<SEQ ID NO: 
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176); AAGCACAAOAiSBQ ID NO: 177) 
178); GCACAAGAOO(SSQ ID NO: 17») 
180); ACAASAQGAG(SBQ ID NO: 181) 
182); AAGAaC3AGGA(SEQ ID NO: 183), 
184); GAGGAOGiAAGCSSQ ID NO: 185); 

186); GGAOGAAOACrSBQ ID MO: 187); 

188); AGGAAOAGGTCSEQ ID NO: 189); 

190) ; GAAaAGGTOG(SBQ ZD NO: 191) ; 

192); AGA6GrraGGT{SBQ ID HO: 193); 

194) ; AGG1XXa6TTT(SBQ ZD HO: 195); 

195) ; GTOaOTTTTC(SEQ ZD NO: 197); 
198) ; 0GOTTTTCCA(SBQ ID NOt 199) ; 
200) ; GTTTTCXAGT<SBQ ZD NO: 201) ; 
202); TTTCCAOTqA{SBQ ID MOt 203); 
204) } TCCAOTCACACSBQ ID MO: 205) ; 
20£); CAOTCACaCC (SEQ ID MO: 207); 
208); GTCACACCTC(8BQ ID MO: 209); 
210); CACACCTCAO(SBQ ID MD: 211); 
212); CACXrrCAOGTOBQ ZD HO: 213); 
214); CCTCIWaGTAC(SBQ ID NOt 215} ; 
21fi) ; TCAGGrACCT(SBQ ID MD: 217) ; 
218) ; AGaTACCrrT(SBQ ZD MOi 219) ; 
220); GTACXrrTTAA(8BQ ZD MD: 221); 
222} ; ACCTTTAAOA(SBQ ID MD: 223) ; 
224); CTTTAA6ACC(8aQ ID NO: 22S) ; 
226) I TTAAaACCAA(SBQ ID NO: 227) ; 
228); AAOACCAATOCSBQ ID MD: 229); 
230); GACCAAT8AC(SBQ ID MO: 231); 
232); CCAATGACTT{SBQ ZD MO: 233); 
234); AATOACTTACCSBQ ID MOt 23S) ; 
236) ; TaACTTACAA(SBQ ZD NO: 237) ; 
238) ; ACTTACAAaO(8BQ ID MDt 239) ; 
240) ; TTACAAOOCACSBQ ID MDt 241) ; 
242) ; ACAAOOCAGC(SaQ ZD MOt 243) ; 
244) ; AAGaCA0CTO(8SQ ZD MD: 245) ; 
246) ; GaCAGCTGTA(SBQ ZD MO: 247) ; 
248) ; CAaCTGTAaA(8BQ ID MOt 249) ; 
250) ; GCTGTAGATC(8BQ ZD NO: 251) ; 
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CTGTAGATCTCSEQ ID NO: 252); TGfTASATCXT(SBQ ID KO: 253). 
GTAGATCTTA(SEQ ID NOt 254); TAGATCTTAO(SBQ ID MO: 255); 
AGATCTTAGCfSBQ ID MO: 2S€) ; OATCTTAOCC (SBQ ID MO: 257); 
ATCTTAGCCArSEQ ID NO: 258); TCTTAOCCAC (SBQ ID NO: 259); 
5 CTTAGCCACT(SEQ ID NO: 260); TTAOCCACTT (SBQ ID NO: 261); 
TAGCCACTTT(SBQ ID NO; 2S2) ; AOCCACTTTT (SBQ ID MO: 263); 
GCCACTTTTT(SEQ ID NO: 2«4)/ CCACTTTTTA (SBQ ID MO: 2€5) ; 
CACTTTTTAA(SEQ ID KO: 266); ACTTTTTAAA (SBQ ID MO: 267); 
CTTTTTAAAA(SEQ ID NO; 268); TTTTTAAAAO (SBQ ID MO: 269); 
10 TTTTAAAAaA(SEQ ID NO; 270); TTTAAAAGAA (SBQ ID MO: 271),' 
TTAAAAaAAA(SBQ ID NO: 272); TAAAAGAAAA(SBQ ID MO: 273); 
AA»AGAAAAG(SEQ ID NO: 274); AAAOAAAAOO (SBQ ID MO: 275); 
AAaAAAAGGO(SEQ ID NO: 276); AGAAAAGCX3G (SBQ ID MO: 277); 
OAAAAGGGGC(SEQ ID NO: 278); AAAAOOGGOG (SBQ ID MO: 279); 
15 AAAOGGOGGA(SEQ ID NO: 280) ; AAOaGGGGAC (SBQ ID MO: 281) ; 
AGGGOGGACr(SEQ ID NO: 282); GaQG0GACnS(8BQ ID MD: 283); 
GOGWaACTaG(SBQ ID NO: 284); GOaQACrOGA(SBQ ID MO: 205); 
00<»CTGGAA(SEQ ID NO: 286), OGACTOOAAO (SBQ ID MO: 287); 
GACTOGAAGO(SEQ ID NO: 288); ACTGOAAOOG (SBQ ID MO: 289); 
20 ClWSAAflGOC(SEQ ID MO: 290) , TOOAAOOGCTtSBQ ID MDi 291) ; 
«3AAOGOCTA(SBQ ID NO: 292); aAAa00CTAA(8BQ ID MO: 293)1 
AAOGOCTAAT(SEQ ID NO: 294); AOOOCTAATT (SBQ ID MOt 295); 
OGOCTAATTC(SBQ ID MO: 296); GOCTAATXCA (SBQ ID MO: 297); 
OCTaATTCAC(SBQ ID NO: 298); OWATTCACT (SBQ ID MO: 299); 
15 TAATTCACTC(SBQ ID NO: 300); AATTCACTCC (SBQ ID MOi 301), 
ATTCACrCCC(SEQ ID NO: 302); TTCACT0CC3^ (SBQ ID MO: 303); 
TCACTCCCAA(SBQ ID NO: 304), CACTCCCAAA (SSQ ID MO: 305); 
ACTCCCAAAG(SEQ ID NO: 306), CTCCCAAAOA (SBQ ID MOi 307)) 
TCCCAAAOAA(SEQ ID NO: 308), CCCAAAOAAO (SBQ ID MOi 309); 
(0 CCAAAaAAaA(SEQ ID NO: 310); CAAAOAAGAC (SBQ ID MO: 311), 
AAAaAA(3ACA(SEQ ID NO: 312), AAOAAGACAA (SBQ ID MO: 313),' 
AaAAOACAAO(SBQ ID NO: 314), OAAOACftAOA (SBQ ID MOt 315)1 
A*OACAAGAT(SBQ ID NO: 316), AOACAAOATA (SBQ ID BO: 317), 
GACAAGATAT(SBQ 10 NO: 318), ACAAOATATC (SSQ ID MO: 319), 
S GAAaATATCC(SBQ ID NO: 320), AAOATATCCT (SBQ ID MO: 321), 
AaATATCXrrT(SEQ ID MO: 322), OATATCCTTO(S8Q ID MO: 323)) 
ATATCCTTOA(SEQ ID MO: 324), TATCCTTCAT(SBQ ID MO: 325); 
ATCCTTaATC(SEQ ID NO: 326); TCCTTCATCT (SBQ ID MO: 327); 



CCTTGATCTG(SEQ ID NO: 
TTGATCTOTG (SEQ ID NO: 
<3ATCTGTCGA.($EQ ID NO: 
TCTGTGGATC(SEQ ID NO: 
5 TGTGGATCTA{SEQ ID NOj 
TGGATCTACC(SEO ID NO; 
GATCTACCAC(SEQ ID NO: 
TCTACCACAC (SEQ ID NO: 
TACCACACAC(SEQ ID NO; 
10 CCACACACAA(S8Q ID NO: 
ACACACAAOGCSEQ ID NO: 
ACACAAGGCT(SEQ ID NO: 
ACAAGGCTAC(SEQ ID NO: 
AAGGCTACTT(SEQ ID NO: 
15 GOCTACTTCC(SEQ ID NO: 
CTACTTCCCT(SEQ ID NO: 
ACTTCCCraA(SEQ ID NO: 
TTCCCTOATTCSEQ ID NOs 
CXXTGArrGO(S£Q ID NO: 
20 CTQATTG6CA(SEQ ID NO: 
CATTO(KaUSA(SBQ ID KO: 
rrGGCAGAACCSEQ ID NO: 
O0CftGAACTA(SEQ ID NO: 
CiVGAACTACACSBQ ID NO: 
25 OAACTACACACSBQ ID NO: 
ACTACACACX:(SEQ ID NO: 
TACACACCASCSEQ ID NO: 
CACACCAGGO{SEQ ID NO: 
CACCAQGGCC (SBQ ID NO: 
30 CCAGOaCCAG(SEQ ID NO: 
AOOGCCAQOCKSEQ ID NO: 
QGCCAaGG6T(SBQ ID NO: 
CCAOOGOTCACSSQ ID NO: 
AGGGOTCAGA(SBQ ID NO: 
35 GOGTCAGATACSEQ ID NO: 
aTCAGATATC(SBQ ID NO: 
CAOATATCCA (SBQ ID NO: 
C3ATATCCACT(SEQ ID NO: 
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328); CTTQATCTGT (SEQ ID NO: 329) 
330); TGATCrGTGG(SEQ ID NO: 331) 
332}; ATCTOTOGAT (SBQ ID NO: 333) 

334) ; CTGrr6GATCT(SBQ ID NO: 335) 

335) ; GTGGATCTAC(SBQ ID NO: 337) 
338) ; GGATCTACCA(SBQ ID NO: 339) , 
340); ATCTACX»CA(SBQ ID NO: 341) j 
342); CTACCACACA(SEQ ID NO: 343); 
344); ACCACACACA(SBQ ID NO: 345); 
34C); CACACACAA8 (SBQ ID MO: 347); 
348); CACACAA0GC(SBQ ID NO: 349); 
350) ; CACAA9aCTA(SBQ ID NOt 351) ; 
352) ; CAAOGCTACT(SBQ ID NO: 353) ; 
354); AaaCTAiCTTC(SBQ ID NO: 355); 
356); GCTACTTCCC(SBQ ID MO: 357); 
358); TACTTCCCTS<SBQ ID MOs 359); 
360) ; CTTCCCTaAT(SBQ ID MOt 361) ; 
362) ; TCGCTaATT6(SBQ ID MO: 363) ; 
364); CCTGATTOaC(5BQ ID MO: 365); 
366)/ TQA7TGtGGAO(SBQ lO MOt 367); 
36$) ; ATTO(iCAGAA(SaQ ID MOt 369) ; 
370); TGOCAaAACT(SBQ ID MO; 371); 
372) ; GCAGAACTACCSBQ ID MO: 373) ; 
374); AaAACTAGAC(SBQ ID NOt 375); 
376) ; AACTACACAC(8BQ ID NO: 377} ; 
378) I CTACACACCA(SBQ ID NOt 379} ; 
380); ACACACCAOOCSBQ ID MOt 381); 
382); ACACCMaGGC(SBQ ID MOt 383); 
384); ACCAOOGCCACSBQ ID NO: 385); 
386); CAaaoCCAGG(SaQ ID MOt 387); 
388); OGGCCAaOaG(SaQ ZD NOt 389); 
390); OCCAOOGOTCOBQ ID NOt 391); 
392) I CAj6aaOTCA6(SBQ ZD NOt 393) ; 
394); OaaOTCAaAT(SBQ ID MOt 395); 
396); 06TCAaATAT(SBQ ID MOt 397); 
398); TCAQATATCC(SBQ ID MOt 399); 
400) ; A6ATATCCAC(8BQ ID NOt 401) ; 
402) I ATATCCACTO(SBQ ID NO: 403) ; 
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TATCCACTGA(SEQ ID NO: 404) ; ATCCACTGAC {SBQ ID MO: 405) ; 
TCCACTGACC(SSQ ID NO: 406); CCACTOACCT (SBQ ID MO: 407); 
CACTOACCrr (8EQ ID NO: 406); ACTQACCTTT (SSQ ID NO: 409); 
CTGACCTTTG (SEQ ID NO: 410); TGACCTTTGG (SBQ ID NO: 411); 
5 GACCTTTGGA(SEQ ID NO: 412); AOCrTKSGAT(SEQ ID NOj 413); 
CCTTTGGATG (SEQ ID NO: 414); CrrTOGATGO (SBQ ID NO; 415); 
TTTGGATGOT(SEQ ID NO: 416); TTGGATGGTG (SBQ ID NOi 417); 
TOGATGGTGC(SBQ ID MO: 41«) ; GGATGfGTGCT (SSQ ID NO: 419); 
GATOGTGCTA($EQ ID NO: 420); ATS6TGCTAC (SBQ ID NO: 421); 
10 TGGTGCTACA(SEQ ID NO: 422); G6TGCTACAA (SBQ ID NO: 423); 
(^TCCTACAAG (SEQ ID NO: 424); TOCTACAAOC (SBQ ID NO: 425); 
GCTACAAOCT(SEQ ID NO: 426) ; C7ACAAGCTA(SBQ ZD NOt 427) ; 
TACAAGCTAG(SEQ ID NO: 428); ACAAGCTAGT (SEQ ID NO: 429); 
CAAaCTAGTA(SEQ ID NO: 430); AA(3CTAGTAC (SBQ 10 MO: 431); 
IS AGCTACTACC(SEQ ID NO: 432); OCTAOTACCA(SBQ ID MO: 433); 
CTA<n'ACCAO(SBQ ID NO: 434); TA6TACCA0T(SBQ ZD MO: 435); 
AaTACCAGTT(SEQ ID NO: 436); OTACOUSTTG (SBQ ID MOt 437); 
TACCAGTTGA(S^ ID NO: 438); ACCAGrTTOAS (SBQ ID MO: 439); 
CCAaTTQAGC(SEQ ID NO: 440); OiffnGMCC iSBQ ZD MOt 441); 
20 AGTTGAGOCA(S6Q ID NO: 442); OTTOAOCCAG (SBQ ZD NOt 443); 
TTGAGCCAaA(SEQ ID NO: 444); TOAOOCACSAT (SBQ ID MOt 445); 
aAaCCAGATA(SEQ ZD NO: 446) / AGCC»OATAA(SBQ ID MO: 447) ; 
GCCAGATAA6(SEQ ID NOt 448); CCAGATAAOO(SBQ ZD MOt 449); 
CAGATAAOOT(SeQ ID NO: 450); AGA.TAA06TA (SBQ ZD MOt 451); 
25 GATAAaGTA6(SEQ ID NO: 452); ATAAOGTAGA (SBQ ID NO: 453); 
TAA60TAaAA(SEQ ID NO: 454); AAOGTTAGAAG (SBQ ID MOt 455); 
AgaTAGAA(3A(SBQ ID MO: 456) | OGTAOikASAGfSBQ ZD MOt 457); 
GTAaAAGAG6(SBQ ZD NO: 458); TAOAAaAQQC (SBQ ID MO: 459); 
AG»A0AaOCC(SBQ ZD NO: 460); GtAQAaOCCXtSBQ ZD MO: 461); 
30 AASAG6CCAA(SEQ ZD NO: 462); AGAOGCCAAT (SBQ ID MO: 463); 
aAaaCCAATA(SEQ ID NO: 464) ; AgaCCXATAA(SBQ ZD MOt 465) ; 
aoCCAATAAA(SBQ ZD MO: 466); GCXSUiTAAAOOBQ ID MOt 467); 
CCAATAAAOG(SBQ ID NO: 468); CAAZAAAGGA (SBQ ID MOt 469); 
AATAAAOGAO (SEQ ZD NO; 470); ATAAAC3GA8A(SBQ ZD MO: 471); 
3S TAAAGGAGAOCSEQ ID MO: 472); AAAQOAQAGA (SBQ ZD MO: 473); 
AAGaAaAGAA(SBQ ID MO: 474); AGGAGA62UIC (SBQ ID MO: 475); 
GaAaAGAACA(SEQ ID NOt 476); Cau3AaAACAC(SEQ ZD NOt 477); 
AaAGAACACC(SEQ ZD NO: 478); aAaAACACCA(SBQ ID MOt 479); 



AGAACkCCAG (SEQ ID NO: 
AACACCAGCT(SEQ 10 NO: 
CACCAGCTTG(SEQ ID MO: 
CCAGCTTGTT(SEQ ID NO: 
5 AGCTTGTTAC (SEQ ID NO; 
CTTGTTACAC (SEQ ID NO: 
TGTTACACCC (SEQ ID NO: 
TTACACCCTG (5EQ ID NO: 
ACACCCTOTG(SEQ ID NO: 
10 ACCCTGTGSAG (SEQ ID NO: 
CCTGTGAaCC (SBQ ZD NO: 
TQTGAGCCTG (SEQ ID NO: 
TGAaCCTGCA(SEQ ID NO: 
A0CCTOCATG(SBQ ID NO: 
IS CCTGCA.TGaA(SBQ ID NO: 
TGCATGOAAT(SEQ ID NO: 
CATG6AAT0O(SEQ ID NO: 
TQaAATGGAT(SBQ ID NO: 
GAATaGATaA(S&Q ID NO: 
20 ATGGATOACCCSEQ ID NO: 
OaATGACCCT(SEQ ID NO: 
AT8AC0CTGA(SEQ ID NO: 
aACXnGAOACSEQ ID NO« 
CCCTaAGAGA(SEQ ID NO: 
2S CTGAGAOAOAtSEQ ID NO: 
GAOACSAGAAOCSEQ ID NO: 
GAGAaAA(rr6(SBQ ID NO; 
GACAAGTGTTCSEQ ID MO: 
GAA0T(3TTAa(SEQ XD NO: 
30 AGT6TTAGAG(SEQ ID NO: 
TGTTAGAOTO(SEQ ID NO; 
TTAGAaTGGA(SEQ ZD NO: 
AGAGTGGAGGCSEQ ID NO: 
AGTGGAGGTT(SEQ ID NO: 
35 TQCSAGGnriTOiSEQ ID NO; 
GAGOTTTGAC(SEQ ZD NO: 
G6TTTGACAG(SEQ ID NO; 
TTTC5ACAGCC(SEQ ID NO: 
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480); GAACACCAGC(SBQ ID NO< 481); 
482); ACACCAGCnT(SBQ ID MO: 483); 
484); ACCAaCTTaT(SEQ ID NO: 485); 
486) ; CACCTTOTTA(SEQ ID NOt 487) ; 
488); GCTTGTTACA{SBQ ID NO: 489); 
490); TT(?rTACACC(SBQ ID WO: 491); 
493); GTTACACCCr(SaQ ID ND: 493); 
49f); TACACCCTGT(SBQ ZD NO: 495); 
49$) ; CACCCrOTGA(SBQ ZD NO: 497) ; 
498); CCCTG;TGAGC(SEQ ID MO: 499); 
500)/ CTOTCJAGCCT (SBQ ZD MO: SOI); 
502) ; GTGAGCCTaCCSBQ ID NO: 503) ; 
504); GAGCCTOCAT (SBQ ID NO: 505); 
506); aCCTaCATOO(SaQ ZD MO: 507); 
508) ; CTGCATCSaAA(SBQ ID NO: 509) ; 
510); GCATaGaiAT6(SSQ ZD NO: 511); 

513) ; ATGOAATaOACSBQ ID MO: 513); 

514) ; aGSAATCX3ATO(5BQ ID NO: 515) ; 
516) r AATQGATGAC(SBQ ZD MO: 517); 
S18); T(K3MraACCC(SaQ ZD MO: 519); 
520); GSATQA<XXrrG(SBQ ZD MO: 521); 
522) ; TGACCCT(3AS<SBQ ID NDi 523) ; 
524); ACCCXOAGMXsaQ ZD MOt 525); 
526); CCTaAaAGA6(SBQ ZD HO: 527); 
528); TGAOAOAOAACSBQ ZD MOt 529); 
530); AO A OAOAAiSTCSBQ ZD MO: 531); 
532); AGAOAAGTGTCSBQ ID MO: 533); 
534); AGAACrraTTA(SBQ ID NO: 53S) ; 
536) ; AAGTGTTAGACSBQ ID NOs $37) ; 
538); GTGTTAG2^T(SBQ ZD VOt 539); 
540); GTTAaAaTaO(SBQ 10 MO: $41); 
542); TA<3MnGGA0(SEQ XD NOi 543); 
544); GAOTOQAGGKSBQ ID MO: 545); 
S4fi) ; GTOQRa(iiTTT(SBQ ID NOt 547) ; 
548) ; CSGAOOTTTQACSBQ ID MOt 549) ; 
550); AGOTTTGAOKSBQ ZD MOt 551); 
552); GTTTGACAaC(SEQ ID MO: 553); 
554); TTGACAaCC:6(SEQ ID MO: 555); 



«o«of i> .'npi^gvmsAjt 
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TGACAOCCGC(SEQ ID NO: 556); GACAGCCGCC (SEQ ID HO: 557); 
ACACCCOCCT (SEQ ID NO: 558); CAGCCGCCTA(SBQ ID MO: 559); 
AOCOGCCTAG (SEQ ID MO: 560); GCCGCCTASC (SEQ ID MO: 561); 
CCGCCTAGCACSEQ ID NO: 562); OGCCTAGCAT(SEQ ID MOt 563); 
5 GCCTAflCATT (SEQ ID NO: 564); CCTAOCATTT<SEQ ID NO: 565); 
CTAGCATTTC (SEQ ID NO: 566); TAGCATTTCA(SBQ ID HO: 567); 
AOCATTTCAT (SEQ ID NO: 568); GCATTTCATC(SBQ ID NO: 569); 
CATTTCATCA (SEQ ID NO: 570); ATTTCATCAC(SBQ ID MO: 571); 
TTTCATCACG (SEQ ID NO: 572); TTCATCACOT(SEQ ID WO: 573); 
10 TCATCACOTO(SEQ ID NO: 574); CATCACGTGO (SEQ ID HO: 575); 
ATCACOTOGC(SEQ ID NO: 576); TCAOGTGGCC (SBQ ID »0: 577); 
CACGTGGCCC(SEQ ID NO: 578); ACGTOOCCCG (SSQ ID MO: 579); 
CX3T0GCCCGA(SEQ ID NO: 580); GTGGCCCGAO (SBQ ID MO: 561); 
TaGCCCaAGA(SBQ ID NO: 582); GGCCOOAGAO (SBQ ID MO: 583); 
15 GCX:caAGAGC(SEQ ID NO: 584); CCXXtSAGAOCT (SBQ ID MO: 585); 
C0GAGA0Cr6(SBQ ID NO: 586); CGAaiVj9CTGC (SBQ ID HO: 587); 
GAaAaCT6CA($EQ ID NO: 588); AGAOCTGCSa' (SBQ ID MO: 589); 
aAGCTOCATC(SEQ ID NO: 590); AOCTGGATCC (SBQ ID MO: 591); 
GCTGCATCCO(SBQ ID MO: 592); CTOCATCOOO (SBQ ID MOi 593); 
20 TOCATOCXXUKSSQ ID NO: 594); GCATCOQGAO(SBQ ID MO: 595); 
CATCO0aAGT(SEQ ID MO: 596); ATCXXSOAOTACSBQ ID HOi 597); 
TCOGKaAGTAC(SEQ ID MO: 598); COOOAOTACT (SBQ ID MO: 599); 
CGGAOTACTT (SEQ ID MO: 600); GGAGTACTTC (SBQ ID MO: 601); 
OAOTACTTCA(SBQ ID NO: 602); AOTACrrrCAA(SBQ ID MO: 603); 
25 GTACTTCAAa(SEQ ID NO: 604); TACTTCAAOA (SBQ ZD MOt 605); 
ACTTCAAGAA(SEQ ID MO: 606); CTTCAAGAAC(SBQ ZD MO: 607); 
TTCAAOAACKSEQ ID NO: 608); TCAA{3AACT0 (SBQ ID MO: 609); 
CAAQAACTOC(SBQ ID NO: 610); AAaAACTCCT(SBQ ID MOt 611); 
AGAACTGCTa(SBQ ID NO: 612); GAACTGCTaA(SBQ ZD MOt 613). 

Tsk a piefcrrad etobodimcnt tite preaeot inveotioa ooalenqpliiM * viial ifolalB ^which: 
(0 is reactive to antibodies to * glyooprotdn fi»m HIV«1 •ocfa MM Mow 

gp41-45, gpl20 and/or spl60; 
00 carries a dde(i<» of at kasttoi nucleotides in a legion oofre^Mi^^ 

m/ gene in HIV-1)4|^3; and 
wherein said deletion encon^>asses one or more decanucleotidcs ftom the nrf gene of 
HIV-l{4i^3 or corresponding sequences from snotiier patfaogeok strain of HIV-1 



defined in (or substantially analogous 
ACCAGCTTGT tSEQ ID NO: 803] 
CAGCTTGTTA 13BQ 1I> NOrBOSj 
GCTTOTTACA [SEQ ID NO 2 807] 
5 TTOTTACACC [SEQ ID NO: 8091 
GTTACACCCT [SEQ ID NO: 811] 
TACACCCTGT [SEQ ID NO: 6X3] 
CACCCTGTGA [SEQ ID NO: 815] 
CCCrGTGAGC [SEQ ID NO: 8X7] 
10 CTGTGAGCCT [SEQ ID NO: 819] 
GTGAGCCTGC [SEQ ID NO; 821] 
GAGCCItSCAT [SEQ ID NO: 823] 
GCCTOCATGO [SEQ ID NO: 825] 
CTGCATGGAA [SEQ ID NO: 827] 
IS GCATQGAATG [SBQ ID NO: 829) 
ATOGAATQGA [SBQ ID NO: 831] 
GGAATGGATG [SBQ ID NO: 833] 
AATGGATGAC [SBQ ID NO: 835] 
TOOATGACCC [SBQ ID NO: 837] 
20 GAtOACCCTO tS6Q ID NO; 839] 
TGACX^CTGAa (SBQ ID MO: 841] 
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) SEQ IDNOs:803 toML- 
CCAOCTTOTT [SBQ ID HO: 804 J 
AGCTTGTTAC (SBQ ID NO s 806] 
CrroTTACAC [SEQ ID WO: 808] 
TOTTACACCC (SBQ ID NO: 8X0] 
TTACACCCTG (SBQ ID NO: 812] 
ACACCCTGTG (SBQ ID NO: 814} 
ACXXrrOTGAG (SBQ ID NO: 816] 
CCTOTGAGCC [SBQ ID 110:818] 
TGTOAGCCTG (SBQ ID NO: 820] 
TGAO&CTGCA [SBQ ID NO: 822] 
AOCCTOCATO [SBQ ID NO: 824] 
CCTGGATGGA [SBQ ID 110:826] 
TGCATOGAAT [SBQ ID 110:828] 
CATOOAATGO (SBQ ID 110:8301 
TOOAATOOAT (SBQ ID liO:832] 
GAATOGATGA (SBQ ID HO: 834] 
ATOGATGACC [SBQ ID 110:836] 
GOATOACCCT (SBQ ID NO: 838] 
ATCACX:CTGA (SBQ ID MO: 840] 



Generally, the subject HIV*1 isolale is noo-paAogoiic 88 hete iu bc fot o defined* 
Additionally, reference herein to "a ddetion"" indudes refeteooe to 8 cotttfsuous or non- 
25 contiguous series of two or more deletions. 

The non-i>athogenic isolate may cany a angle decanucleotide ddetiofi or may cany 
mm than one decanudcotide deletion. Where H carries mtdti|te deledoos tfieae may 
all correspond to a coi^guous sequence or be from differeot parts ot the nff gene. 
30 FurOennore, the terminal end porttoiks of a ddetioii may 

defined above It is emphasised tfiat (he present iotveatioe extendi to aoalogoaa 
aequcooes from odier pafiMgeaic stndns of HIV-1 iwUch mifllht cany firf geaaa ^vith a 
slightily altered sequence relative to HIV*lj^« 
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In a most pieferted embodimeat of tfie present invention, tfaeie is provided a non- 
pafhogeoie strain of HIV-] canyiag a nucl«otide sequcaoce in its genMae as set fordi 
in SEQ ID No. 614: 
5 aAAGAGATTTO0SAaAACATGACCrGG»TGCAST(XX3AAAAA(»AA 
AAAATACATATACTCCTTACTTOUUIAATCGCA6AACCAA<^^ 
AACTATTGGAATTGGATCAATGGGCAAGTTTGTGGAAT^^ 
CTGrrGXrrATATAAAAATATTCATAAnjKnA^ 
TTTTOCTCTACTTTCTATAffrGSAATACS^^ 
10 AOACXTCTCCTCXXAACCCaSAGOaGACCXrGACAGGCCCGAAG 
GGAGAGACUlCauaiGASACAGATCCACT<X^ 
OOAOQACCTOAGGACKrCIXTrGCXTrCTTCCTCTACC^ 
TTGTAACAAGGATTQTCXSAACTTCTaOGACGCAOGKXSATGK^^ 
TGK3AACCTCCTAAAGTATTG6ACXX3«ySAACTGaU^^ 
IS TGCCACCSCCATACKACrrAGCTGAGOGCACS^ 

CTTATAGAGCTATCCTCCSlCATACXn'AGAAGUUa^^ 

GGXnX3GGGGKXAAC»ACTAAaUTGCrGATCX^ 

AOMWSOGKISTTTTCXAaTCAAACCTCMXITAaCI^ 
20 AAAAOOGNXnACrocaAAOaaCrAATTCACI^XSCM^^ 

AAACTATTAOC!AgrCWACHX3UXX3AAGHV^^ 

A rwriT C ' iV rA U GiGGACTTTCCATCagr^^ 

OACTAOTTCCOGTaaGOACTTTCCAAOAAGaCSaOG^^ 

G»GCCCTa^TCK:TGCATATAA0CMSC1XX:TTrcrr^^ 
25 AGACXAGATCIXSaGCCTGKXSAGCTCI^^ 

AATAAAGCTT<XX:TTQAgT(X:TTa^OTA(m>TOTXX^ 

ATCTAOA; 

andA>r $EQ ID NO: 615: 
30 GUUUlCAATTTOOOATAACATGAGCIXKUVTaaunOQK^^ 

AAACATAATATACavCXrrrA&TTOAJtflAATCXXawe^ 

AATTATT(X3AATTa6ATAAATCKX:CAAATriXnr^^ 

CrATGGTATATAARATTATTCATAATOQTA<naU3K?A ^ 

TTTTACTOTACTTTCTATAGTqftATA0AGTTA0QCA0GK3ATACTCA<^^ 
35 AGACCCACCTCCCAACCCCaftAOaaACCCaACJUXX: 

GaAaAaAaAaACAOA<XKAGCTCCACnKX3ATTAOTOCAG(m^ 

aOACXJA<XrrGJUSX3AGTCTGT0CCT^^ 
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TTOTAACGAOGATTGTOGAACITCTGf^ 

TGGJ^TCrrCCKK:AGTATTGGAGGCAC^^ 

TGOaiCGOC(»TAGGAOTAGCTGAGGGCAa^ 

CTTATAGAGCTATTCTaaCATACCTAGAAGAATAAGA 
5 CTATAAAATGGGTGGCAAXnXj^CAGAAAG^^ 

OACAAGG^GknATAGATCTTAGCCGC^^ 
AATTCACTCACAGAGAAGATCAG^^ 
AAAACAACAOATTtnTCamTOTTCOGTr^^ 
GATAAGCCGCTGOQGACTTTCCGAAGAGGCGT^^ 
1 0 TGGGCGGGACTGGGGAGTGGCGAGCr CTO^ 
CTGTACrGGGTCTCTCTGGTTAGACC^ 
GGAACCCACTOCTTAAGCCraulTAAAGCTTGCCT^^ 
CQTCTQTTGTQTQACTCTQaTATCTAQA * 

IS Tbc present invention, however, extends to HIV-1 isolates wfakh are non-psdiogenic; 
cany genomes capable of hybridiang under low stringency conditions to SEQ ID NO: 
6 14 w SEQ ID NO: 6 1 S ; and which do not direct tfie synthesis of a ftiU length ne/gcac 
product 

20 In a ftttther embodiment the jmsent invention ccmtemplates a viral isolflfte ^ch: 

0) is reactive to ai^ibodies to a glyc<^mtetn fiom HTV«1 audi as at least one of 

gp41-4S» gpl20 and/or o>160; 
(ii) carries a genome or a part or fragment tiiereof capable of hybridising under 
medium stringency conditions to a nucleotide sequence as set fbrHi in SEQ ID 
25 NO: 1 or a comirfementaiy fonn hereof or an analogou aecpimce from anottker 

patfaogcoic strain of HIV-1; 
Cm) carries a deletion of at least ten nucleotides in a region correapoading to the LTR 

region in HIV-1)^3: and 
wherein sud deletion encompasses one or more of the following decanodeotides fiom 
30 the LTR region of HIV-l^^ os cotresponding sequences from anotlier padiogenic 
strain of mv-l: 

GCTTTTTGCC (SEQ ID MO: 6S2) ; CrrTTTGCCT (SBQ ID KOi «53) ; 
TTTTTCCCTO (SEQ ID NOi €54) ; TTTTOCCTGT (SBQ ID NO: 65S) ; 
35 TTTGCCTCfTA (SEQ ID MO: 656) / TTOCCTGTAC (SEQ ID MO: 657); 



TGCCTOTACT (SEQ ID NO: 658) / 
CCTGTACTOG (SEQ ID NO: 660) ; 
TOTACTGOGT (SEQ ID NO: 662) ; 
TACTOGGTCT (SEQ ID NO: 664); 
5 CTOGGTCTCT (SEQ ID NO: 666) ; 
GGOTCTCTCT (SEQ ID NO: 668) ; 
GTClCTCraa (SEQ id NO: 670) ; 
CTCTCTGOTT (SEQ ID NO: 672); 
CTCTGOTTAO (SEQ ID NO: 674) ; 
10 CTGGTTAOAC (SEQ ID NO: 676) ; 
GGTTAGACCA (SEQ ID NO: 678); 
TTAGACCAGA (SEQ ID NO: 680); 
AGACCAGATC (SEQ ID NO: 682) ; 
ACCAOATCTC (SEQ ID NO: 684) ; 
15 CAGATCTGAG (SEQ ID NO: 686) ; 
GATCTCSAGCC (SEQ ID NO: 688); 
TCTGAQCCTO (SEQ ID NO: 690) ; 
TOAOCCIGGG (SEQ ID NO: 692); 
AGCCTOOOAO (SEQ ID MO: 694); 
20 OCKWGAOCT (SEQ ID NO: 696); 
TOGGAGCTCT (SEQ ID NOt 698) ; 
OCSAGCTCTCT (SEQ ID NO: 700) ; 
AGCTCTCTOO (SEQ ID NO: 702) ; 
CTCTCTGOCr (SEQ ID NO: 704) ; 
25 CTCTGGCTAA (SEQ ID NO: 706); 
CTGOCTAACT (SEQ ID NOt 708) ; 
OOCTAACTAG (SEQ ID NOi 710) ; 
CTAACTACJOO (SEQ ID HO: 712) ; 
AACTAOGOAA (SEQ ID NOs 714) ; 
30 CTAGGGAACC (SEQ ID NOt 716) ; 
JUX3GAACCCA (SEQ ID MO: 718) ; 
OOAACCCACT (SEQ ID MO: 720) $ 
AACCCACTGC (SEQ ID NOt 722) ; 
CCCACTOCTT (SEQ ID NO: 724) ; 
35 CACroCTTAA (SEQ ID MO: 726) ; 
CTGCTTAAGC (SEQ ID NO: 728) ; 
OCTTAAGCCT (SEQ ID NO: 730) ; 
TTAAOCCTCA (SEQ ID NO: 732); 



GCCTGTACTG (SEQ ID MO: 659) ; 
CTGTACTGGO (SEQ ID NO: 661) ; 
GTACTGGGTC (SEQ ID NO: 663) ; 
ACTOGGTCTC (SEQ ID MO: 665) ; 
TGGOTCT<rrC (SEQ ID NO: 667); 
GOTCTCTCTO (SEQ ID MO: 669) ; 
TCTCTCTGGT (SEQ ID NO: 671) ; 
TCTCTGGTTA (SEQ ID NO: 673); 
TCTCTGGTTA (SEQ ID MO: 675) ; 
TOOTTAGACC (SEQ ID NO: 677) ; 
GTTAOACCAG (SEQ ID MO: 679) ; 
TAOACCAGAT (SEQ ID NO: 681) ; 
GACCAGATCT (SEQ ID NO: 683); 
CCAGATCTGA (SBQ ID MO: 68S) ; 
A6ATCTGAGC (SBQ ID MO: 687) ; 
ATCTGAGCCT (SBQ ID MO: 689) ; 
CTOAGCCTOG (SBQ ID NO: 691) ; 
GA0CCKK3GA (SBQ ID MO: 693); 
aCCTOGGAOC (SBQ ID MO: 695) ; 
CTOOCSAGCTC (SBQ ID MO: 697) ; 
OOQAOCTCrC (SBQ ID MO: 699) ; 
OAOCTCTCTG (SBQ ID MO: 701) ; 
OCTCtCTQGC (SBQ ID MO: 703); 
TCTCTGGCTA (SBQ ID MO: 705); 
TCTOOCTAAC (SBQ ID NO: 707); 
TOOCTAACTA (SBQ ID MOs 709) ; 
GCTAACTAOG (SBQ ID MO: 711) ; 
TAACTAOOGA (SEQ ID MOt 713) ; 
ACTAGQGAAC (SBQ ID MO: 715) ; 
TAOOOAACCC (SBQ ID MO: 717); 
OOGAACCCAC (SBQ ID MO: 719) » 

GAAOCCACTO (SEQ ID NO: 721) ; 

ACCCACTOCT (SBQ XD MOt 723) ; 

CX31CTGCTTA (SBQ ID MOs 72S) ; 

ACTGCTTAAO (SEQ ID MO: 727) ; 

TOCTTAAGCC (SBQ ID MO: 729) » 

CTTAAOCCTC (SEQ ID MOt 731) ; 

TAACCCTCAA (SBQ ID MO: 733) ; 
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AAGCCTCAAT (SEQ ID NO: 734); AOCCTCAATA (SEQ ID WO: 735); 
OCCTCAATAA (SEQ ID NO: 736} ; CCTCAATAAA (SEQ 10 NO: 737) ; 
CTCAATAAAG (SEQ ID NO: 738); TCAATAAA(3C (SEQ ID NO: 739); 
CAATAAACCT (SEQ ID NO: 740) ; AATAAAOCTT (SEQ ID NO; 741) ; 
5 ATAAAOCTTG (SEQ ID NO: 742); TAAAGCTTOC (SSQ ID NO: 743); 
AAAGCTTGCC (SEQ ID NO: 744); AAGCTTGCCT (SEQ ID NO: 745); 
AGCTTGCCTT (SEQ ID NO: 746); GCTTGCCTTG (SBQ ID NO: 747); 
CTTGCCTTOA (SEQ ID NO: 748) ; TTCCCTTGAG (SEQ ID NO; 749) ; 
T<SOCTTGAGT (SEQ ID NO: ISO); aCCTTGASTQ (SBQ ID MO: 751); 
10 CCTTGAOTOC (SEQ ID NO: 7S2) ; CTTGAGTOCT (SBQ ID NO: 753) ; 
TrGAGTGCIT (SEQ ID NO: 754); TCUOTOCTTC (SBQ TD MO: 755); 
CaurPGCTTCA (SEQ ID NO: 756); ACRTGCTTCAA (SEQ ID NO: 757); 
GSTaCTTC»AC (SEQ ID MO: 758); IXXTTTCAAGT (SEQ ID HOt 759); 
OCTTCAAGTA (SEQ ID NO: 7«0) ; CTTCAAGTAG (SBQ ID WO: 761); 
15 TTCAAtyTAGT (SEQ ID NOi 762); TGAROTACrro (SBQ ID WO: 763); 
CAAGTAOTOT (SBQ ID WO: 764); AAOTACTCTC (SBQ ID MO: 765); 
AGTAOTGTOT (SEQ ID WO: 766); GTAGTGTGTG (SBQ ID WOt 767); 
TAGTGTOTOC (SBQ ID NO: 768) ; AOXXnGTOOC (SBQ ID WDs 769) ; 
OTOTOTGCCC (SBQ ID NO: 770); TOTfiT0C«<X» (SSQ ID WO: 771); 
20 GTOTaCCCXST (SBQ ID NO: 772); TGTGCCCX3TC (SBQ 10 WO: 773); 
GTGCCCOTCT (SEQ ID NO: 774) ; TOCCCXSTCIG (SBQ ID WO: 775) ; 
GCCOGTCTOT (SBQ ID WO: 776); CC03TCTGTT (SBQ ID WO: 777); 
CCOTCreTTO (SBQ id NO: 778); COTCrBTTOT (SBQ ID WO: 779); 
GTCTOTTOTO (SEQ ID WO: 780) ; TCT0TTGT6T (SBQ ID MO: 781) ; 
25 CTOTTOTGTG (SEQ ID WO: 782) ; T(JTTOTOTX5A (SBQ ID WO: 783) ; 
GTTOTOTOAC (SEQ ID MO: 784) / TTGTCW3ACT (SBQ ID MO: 785) ; 
TGTGTGACTC (SBQ ID NO: 786) ; CfTGTCACTCT (SBQ ID MO: 787) ; 
TOTOTXSACTC (SEQ ID NO: 788) ; OTOTGACTCT (SBQ ID NO: 789) ; 
T0TGACTOX5 (SEQ ID NO: 790) » OTQACTCrOQ (SBQ ID MO: 791) ; 
30 TGACTCTGGT (SEQ ID HO: 792) ; GACTCTOOTA (SBQ ID MO: 793) ; 
ACrrCTGGTAA (SBQ ID NO: 794); CTCTGGTAAC (SBQ ID HO: 795); 
TCTOGTAA<rr (SEQ ID NO: 796) ; CTOOTAACTA (SEQ ID NO: 797) ; 
TCXSTAACTAO (SBQ ID HO: 798) ; 0<3TAACTAGA (SBQ 10 NO: 799) . 

35 llieiKm-intfiogenwisohteinaycanyasmgledec^^ 

or may cany multiple deletions in tiie same i^on or in tiie LTR n^oa aiii anodwr 
regiott such as tlie ne/gene. In particular, the mutation may be in ti>e LTR/n^ovaiap 
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regioa Where it canies multiple deletions, these may eonespond to a contiguous 
wqueoce or be from di£ferent pMts of the LTR region and/or iie/gene. Furthermore, 
the tenuinal end portions oft deletion may lie within a decanucleotide as defined above, 

5 Yet anothor aspect of the present invention provides an infectious molecular clone 
oonqxising genetic sequences derived from die nton-pathogeaic HTV-I iaolates as 
hereinbefore described and includes genetic sequences encoding maior structural proteins 
such as ga;g; «Mv and pol. Infectiouf molecular clones are particularly uaeftil as genetic 
compositions arable of "infecting" host cells widKMit need of vhtal coat The infectious 
10 HK^ecuIar clones of the present Invention m^ also be derived fiom padiogenic HIV-1 
strains rendered non-padiogenic as hereindescribed. 

According to this latter embodiment, dusre is contemplated a meAod of attenuating a 
pathogenic strain of HTV-l, said method oomprisiag inducing a mntitioii intbtntf gene 

15 and/or an LTR r^ion to generate a non-padiogenic HTV-l strain as hereinbefore 
described Preferred mutations are deletions of at least ten nucleotides such as one or 
more ofthedecanueieotides as hereinbefore described. Particulariy prefierred mutations 
result in modified Nef protein carryiiv a detedon substantially wi c sp ond in g to amino 
aeids 162 to 177 of Ncf ftom wild-type HTV-l. TTie mntaiioiw ntj also constitute 

20 substitutions and/or insertions of heterologous nucleic adldnK>lee«deafaidu»M^and/or 

LTR regions such as the incoq>oration of a sense (i.e. oo-wppiession) or nntisense 
molecule. 

In stitt yet another embodiment of the present Invention, these is provided an isolated. 
25 noihpathogenic strain of HIV-l comprisiDg a deletion in Hs gfioowe of at least 10 
nucleotide within tht region of nucleotide 8787 and 9709 and mote puticularly within 
tbe regioa 8800 and 9700 and even more preferably within the legtoa 8800 and 9410. 
using tiie nucleotide numbering of HIV*1|^3. 
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In one embodiment, the deletion of at least 10 nucleotides is fiom within a region 
selected fiom die list consisting of (using die nucleotide numbers of HlV-I^gj^^): 
nucleotide (i) 8830-S862; 
5 Oi) 900^9035; 

(Ut) 9019-9029: and 

Civ) 9033^9049. 

In another embodiment, the deletion of at least 10 mideotides is fixnn within « regiMi 
1 0 selected fiom the list consisting of (using die nucleotide numbers of HIV-l,^}): 
nucleotide (v) 9281-9371; 

(vi) 9281-9362; 
(viO 9105-9224; and 

(viii) 9271-9370. 

15 

In yet another embodiment, the deledon of at least 10 nucleotides is fiom within a 
r^on selected fixun the list consisting of (using die nucleotide aumben of HIV-1,41^3): 
nucleotide (tx) 8882-8928; 

(X) 8850-9006; 
20 (xi) 8792-9041; and 

(jdi) 9112-9204. 

In still yet another embodiment, die deletion of at least 10 nucleoddea is fiom widiin a 
r^ion sdected from die list consisting of (using the nucleotide numbers of HIV-li^i^j): 
25 nucleotide (xiii) 9105-9224; 

(xiv) 9389-9395; and 

(XV) 9281-9366. 
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The iribofve embodiments including any combinations dxreof or flmctkiaal equivalents 
dKieof. Most prdlSmed variants of HIV-1 ate defined in Table 3. 
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Refeicnce herein to the non-pAthogetuc HIV-1 sirams of die pfesent inveation includes 
reftraioe to ccnnponents. ports, fi^gments and derivatives thereof includittg both genetic 
and non-genetic m*^^»^ Furthcnnore, the non-pathogenic HIV*1 strains may be in 
5 isolated form or resident in peripheral blood mononuclear cells (PBMCs) or other lite 
cells where the genome of the HIV-1 strains is integrated as DMA £rom said KV-l 
strains such as pKOviral DNA. In addition, the present inventiM extends to recombinant 
virus such as fiom (or resident in) prokaryotes or eulcaryotes as wdl as in the form of 
infectious molecular clones. 

10 

Accordingly, the prcsmt invention provides for the non*pathogeruc HTV-l isolate, 
genomic material tfierefirom, con^ilementary proviral DN A, molacalar infectious clones 
reccmibinant viral particles or genedc sequences thoefiom or odis expressing same or 
Mood cells carrying provirai DNA or to any mutants^ dcfivativM» components, 
IS firagments, parts, homoi<^pes or analogues of tfie forgoing. 

Another aspect of Ae present invention ccmtemplates a synthctie peptide oomprisiog a 

sequence of amino acids as defined hi SBQ ID NO:699 or a part or a ftigmeo^ 

The lyndftetic peptide generally comprises at least fiyur» pieAtably at taast five, more 

20 preferably at least six and even mote preferably at least ae^ or mora of die amino 
acids as defined in SEQ ID NO:699. I^lrtflermoce, die ^iithetie pqKidea miy CMi^^ 
djcmtcally modified amino adds or structurally, functionalty taO/or atereo ch emically 
equivalent substitute amino acids. Such synthetic pq)tides «e useftd in diagnostic 
pfotocols or indterapeutic regimums, such as in generating utibodies to diat particular 

25 region of die Nef protein. By administering die ^dictic peptides of lUs aspect of die 
present invention^ higher titres of antibodies to a particular region ci Nef may be 
obtained coo^ared to ^rfaat ^would be stimulated if a larger 

The iion-padiogenic HIV-1 strains of die present invention atopailiculadytts^ 
30 development of dien4>eatic compositions, dien4)eutic molecules and/or tfiagnostic 
reagents. Widi regards to ibc fomxt^ die non*padiogenic HIV-1 strain may be 
c<m^dered as a live attenuated vaccine vdiere individuak carrying DNA derived £rom 
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sudnon-p«tliogeiucHIV-l stninsucb as provizalDNA in target cdls are protected from 
infectkm a «mespoodii« palhogauc stnd^ 

sense as a tiborapeutic composition or molecule wixidi prevents or reduces HTV-l 
infection or risk of infixtion or «4ich ameliorates the s)V4)t^ It may 

5 involve the stimulation of an immune response or may involve bloddng HIV- 1 cells 
receptors and/or the use of genetic compositions, for example, to inltoduce nbozymes 
or antisense molecules to HIV-I directed genetic sequences or to prqiare infectious 
molaeular clones. For convenience, all such compositions are refaied hereinafter to 
"tiMtapeutic eompositioiiB''. 

10 

Accordingly, tiie presmt invention conf«nplat<is a metiiod for inhibiting or reducing the 
risk of infection by apirthogenic strainof WV-l. said method coopiising administering 
to a subjea a non-padK>genic HlV-1 as hereinbefore defined in an amount effective to 
infect target cells and to generate target cells carrying proviry DNA fixMn said non- 

15 padwgenic mV-l. More particularly, the present inveation co mnnp la l et a mediod for 
inhibiting or ledudng productive infection of an individual by a pathogeaic strain of 
mv-l. said mediod comprising administering to a sid]!ject a aoa-pailiogeaie isolate of 
HIV-1 in an amount effective to infect target eelltaad to generate tifgeccelbGaiTyi^ 
piovital DNA fiom said non-pathogeotcmv-l. By "productive infection" as used in 

20 die qtecificatitm and claims herein is meant die infection of a cell or cells by a 
pathogenic strain of mV-l which leads ultimaldy to Hit qfoaptomology of AIDS or 
AIDS related diseases. A oeU infected productively pcoduoes patliogeiiie virtoos. By 
definition, infection of an individual by a nofr-padtogenic strain of HIV-1 would not lead 
to productive infection. Hmi-pathogenic HIV-1 strains generally leplicate to a sufiSdent 

2$ extent to protect ceUs against challenge by virulent or patiiogette strains. Tbemetfaods 
of the present invention are i«>plicabtepR>phylactieaUy (i.e, to prewtnt di novo infection) 
or therapeoticany O-e. to reduce or slow disease progtessloa). 

The present invention ftntfaer provides a method for vaccinating an individual against 
30 die development of AIDS or AIDS-telated diseases, said melliod comprising 
administering to said individual a non-patiiogenic isolate of HIV-1 in an amount 
effective to infect target cells and to generate target cells carrying proviral DNA from 
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said non-patbogemc HIV-1. The tenn ''vaccinating" should not be taken a.^ limiting die 
invention to 4c prevmtion of HTV-I infection by solely immunological means. The 
term ''vaccinating'' includes any means of preventiog productive Infection of an 
individual by padiogenic HIV-1 . Particularly, prelfened non-pathogenic strains of HIV-1 
5 according to these aspects of the ]»esent invention are generally defined as encoding a 
modified proteia such as a modified Nef protein and m 

which is substantially n<m-interactiye to antibodies to amino acids 162 to 177 of wild- 
type HIV-I Ncf. 

10 As an altmative to the above methods, a tharapeutic compositioa as hereinbefore 
defined is administered. The non^pathogenic isolate may be administered Mer alia as 
an isolated viral preparation or vto infected blood cells. Another aspect of Reinvention 
provides a Aerapeutic composition far inhibiting or reducing the risk d infection by a 
patibogcnic strain of HIV-1 said tfien4)eutic composition oonqprising a ncm-psttiogenic 

15 strain of tflV-i of gen^c sequences derived tberefiom as herrari>efi9fc des^^ 
optionally one or more pharmac<nirically accqitable carriers and/or dfluent^ Inafurther 
i>Pi^H>dim«"*i the therapeutic compositions comprise the synthetic peptides comprising 
an amino add sequcnoe as set forth in SEQ ID NO:699 or a pirtp fiigment or 
bomologue thereof . Such a vaedne wouM generate hig|i titre aotibodkt to a 
20 re^on of Nef protein. 

Tlie flienfMUtic ooo^osition of tiie present invention is generally suitable for 
intravenous^ intnQ>eritoneal, intramuscular^ intramuoosai (e.g. nasal qMtiqr. respiratory 
apray) or odier forms of pamteral administration. The tiMMpeutic coo^Msitioa mi^ 
2S also be admiiustered via an implant or rectally or orally. In addition to Ae mutations 
contenqdated above, the non-patibiogcnic HIV-1 strain may also contain one or more 
oOier Buitaticms to furflier reduce the ride of ravtrsicm to vbuknoe and/or to insert a 
genetic sequence capd>le of providing directly or indirecdy an idand 
further anti-HTV-l properties and/or inununostimulatory or cell regul^ory properties. 

30 
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10 



For example, the non-pathogenic HIV-1 isolate in the thetapcatic compositioa may 
comprise additional genetic material capable of directing expce»too of antisense 
nucleotide sequences to inhibit oqvession of one or more proteins aicoded by a 
pathogenic strain of HIV-1. Aftematively, sense co-suppiession may be employed, 
Pkeftned sense or antisense molecules -would reduce repression of die it^gene or affect 
normal functioning of the LTR region. In a particularly pcefetred embodiment, the 
nucleotide sequence encoding amino adds 162 to 177 is targetted by sense or antisense 
molecules. 



According to this embodiment, the non-pathogenic HTV-l strain may be considered as 
a targeting agent to introduce genetic constructs capable of reducing expression of one 
or more HIV-1 proteins or polypeptides. In this embodiment there is provided a viral 
isolate which: 

15 (i) is genetically or hnmunologicaUjr related to a palfaogenie strain of HIV-1 ; 
(iO is substantially non-pathogenic in human wbjects; 
Ciii) comprises a first nudeotidesequeiieeconriitutiBg its tenomewbic^ 

of Iqrbridising under medhun stiiugeocy conditions to SEQ ID NO: 1 or a 

complementary form thereof, and 
20 (iv) comprises a second nudeotide sequence within said first nucleotide sequence and 

which second nucleotide sequence directs exprcsstoa of a mRNA molecule 
enable of inhibiting, reducing or oAerwiae down-regulating translation of a 
protein or polypeptide encoded by a pathogenic strain of mv-l Of Inhibit, reduce 

<» odierwise down regulate operation of a non-piotein enoo(fing » region of a 
25 padiogenic strain of HIV-1. 

Prefexred proteins or polypeptides targeted for reduced expression are tfiose encoded by 
one or more of the following: gu^M env, tat. rev. ypK ypr. vfTand/or ne/genes. The 
most preferred proteto oc polypeptide targeted for reduced eiqpression U the product of 

30 the we/ gene. Alternatively, the target nucleotide sequence doea not encode a 
polypeptide or protein but is required for other fimctlons such as intepalion or excision 

fiom Ac human genome or expression of genes on flw viral genmne. An example of 



-50- 

such • nucleotide sequeKe is the LTR region. Accozdingiy, the present invention 
extends to dismption to the fimction of such regions. 

In ■ particularly jntfened embodiment there is pcoyided m viral isolate which: 
S (i) is genetically or immunologically related to a pathogenic strain of HTV-l ; 

(ii) is substantially ncm-pathogenic in human subjects; and 

(iii) coiDpases a nucleotide sequence which directs expression of a mRNA molecule 
capable of inhibiting, reducing or otherwise down-regulating tranaiation of Nef 

10 Preferably, the nucleotide sequence reduces levels of amino adds 1^ to 177 in Nef. 
The above aq)ect relates 10 use of antisense technology. The pitsent invention extends, 
however, to use of ribozymes and/or co-suppression to adiievctf^ In an 

alternative embodiment, or in addition to, tibie second (or optionally a tfurd) nucleotide 
sequence encodes an antiviral agent (e.g. interferon) and/or an immune en h a n ci n g agent 

15 

The identification of deletions im^r of/a in the ne/ gene and/or LTR region in 
aqrmptomatic sutj^ I«>vides a unique o effects of 

attemtatedHIV-l strains canrying one or OMmnui^ In 
paiticular, the present invention provides a means for designing 

20 directed to inhUHtingexpiession of a iie/gene and/or LTR 

strain (such as contemplated above) as well as developing a therapeutic legimem aimed 
at inhibiting the activity of the ne/gene product for the fiinctioa of the LTR region. 
Aooozding to this latter embodiment, the present invention provides a therapeutic 
ccMuposition comprising a molecule eif>abie of inhibiting the intnoellular activity of the 

25 ftef gene product and/or LTR region, said c(miposition further comprising one or more 
pharmaceutically acceptable carriers mi/ot diluents. In one prefer r e d enabod im e n t, the 
inhibition efifects amino acids 162 to 177 of Net 



Hie molecule contemplated by ^ above aspect of ttie snlgect invention mMy be a 
30 protein, polypeptide, peptide, chemical compouDd^ sugar moiety or derivative of the nrf 
gene product Tlie molecule will need to be capable of entering an infected celL Where 
the molecule is a protein, polypeptide or peptide, it may be eaooded by the second 
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nucleotide yqifiK^ on die tatgeting vector as conTwnplatod above. Alternatively, die 
molecule be a nucleic add motocule arable of targeting the mfv^ ^ 
Kgton. Whejre the nefasat is targctted, the preferred region is Ae micteotide lequence 
which encodes amino adds 162 to 177 of Nef. 

5 

The deletion nnitants of the piesem invemion may resiilt in n modified M^gene prodiK^ 
dther having no ieadi> disoenabfe activity or having activity differ 
oecufring ne/pratein. In any event, if a mutant iw/gene product is produced, it will 
genenOly hawe a lower molecular wdi^ than the naturally ooeunn^ 
10 have a diifereat overall amino add sequence. Importantly, it will be immunologicaUy 
distinguishaUe from wild-type Nef in durt it will be substantial^ noo-intenctive with 
antibodies to a particular tegion of Net sudi as ammo acids 1«2 to 177 of wSd-^ 
Nef. Thisprovides.tiMfefoK.forameansf<>rdiagiK>singindividttaUwi^ 
infecti(m 1^, for exanqrfe. assaying for a modified nef protein or scieefung for a 
IS modified n^gene sequence. Altenativdy. benign mv-l tnfbetiooBMy be detected by 

assaying for a modified LTR region such as an altered mi ckioti dr wr g m i ttn 

These afotementioaed sheets of Ae pceseot mventioD apply to icneniag ddetioo, 
truncation or other mutants of mv-l^ved proteins where suofa mutations result in a 
20 strain of KV-I being substantially nottiMtfaogenic. AhfaouifinvariaQr of procedures 
are availsble to delect a modified HIV-t-dccivod protein, • pvticulariy oonvenlent 
approach is to scnen HIV-1 Infected individuals fbr the abaenee of antibodies to the 
deleted or truncated portion of a target protein. 

2S According to one eadKxfiment diere is provided a medtod for determining the 
padiogenicity of ft strain of mv-l after said HIV-1 strdn infbctt cells of an ludividuai 
said mediod comprising contacting • biological sample from said individual widi a 
pqitide coiTeq;>onding to a deleted or truncated R^n of an HlV-l-desived protein and 

screening for die absenee of antibody binding to said peptide, wberdn die abaenee of 
30 antibody binding is indicative of a deletion or truncation in diat proton and fuidier 
indicative of the non-padiogenidty of said strain of HIV>1. 
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Altlx>ugh this general methcxlol^ which 
is critical for pathogenicity in a host, it is pafticdariy useftil in acreeaing ddetions in 
the Nef protein. An exemplary ddetion in the Ncfprotein is all or pttt of the sequence 
of amino aci<Is 162 to 177 of Nef Preferably, the assay would include a positive 
5 control such that where antibodies are imsent to NeC such antibodies wouhl bind to diis 
positive control WhenaNefproteincamesaddetionatOf about amino adds 162 to 
177. antibodies to this region would be absent in an individual infected by a non* 
pathogenic strain of HIV-I and no binding would be detected to a pe|^ 
region. 

\ 

According to a particularly preferred embodiment, there is contrmplatad a method for 
detenninlng the pathogenicity of an HIV-1 strain after said strain infects cells of an 
individual, said mediod comprising contacting a biological sample from said Individual 
with an effective amount of a peptide having an amino acid ff*m>*nff ooaipristng or 
within amino adds 162 to 177 of wild-type HIV-1hl43 ^ contact being for a 
time and under conditions sufiRdent for an antibo^ if preseot In said biological sample 
to form a oom^ witih said pqMide and tfien detectiiig die pveseaoe of said axr^plex 
whereiniheabaenGeofaooDq)lexisanindiviAiaIaettip^^ is indicative 

ofthat the individual being infected with a non-psdioge^ Themetfaod 
vaay also comprise amtacting the biological aarc^e witfi one or more pesjOde^ derived 
fixMcn other regions of Nef such as flanidng regions to amino adds 162 to 177 yAnate 
antibodies to sudi other regions inay be expected even in nao^patho^ straioo. 
This would provide a suitable positive control A biological sample according to this 
embodiment would be any source of antibodies sudi as aennn nd whole blood 
PreferaUy, the peptides are imnuibilized to a aoUd supfKXt as des^ This 
method of the present invention is also apf^cable for determimng tbt risk of an 
individual seropositive for HIV-1 developing symptoms of AIDS. Again, •»tK«fait%t;ftl 
absence of antibodies to amino acids 162 to 177 of Nef in an HIV-1 seropositive 
todlvidual would be indicative of a low risk of an individual devekpii^ AIDS. 
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Othcr medxxls of screeoing for the pathogeniciQr or oAemiae of dzains of HIV-I 
readily become apparent as a result of 4e preset^ inventioa. For example, antibodies 
may be first generated to modified Nef pioteins from noo-padiogenic strains of HIV-1 
5 wfaete snch antibodies would not recognise vnld-^ypc Nef protein. 

According to one embodiment, there is provided a melbod for determining Ae 
pathogenicity of an HIV-1 strain after said HIV-1 strain infects cdls of an individual, 
said metfiod eonqxising contacting a biological sample from said individual with an 

10 efifeetive amount of an antibody speaSc to a ne/pntan from a noiirpidiog«nie strain 
of JFflV-i (as hef^befbre defined) for a time and under oooditicms efficient to form 
an antibody-modified protein complex and then detecting said complex. The 
pceaenoe of saM complex is indicative of a modified gene pfodoct and of the non- 
palhogenicity of flie attain of mv-l. Alteniativdy. the isoiated Nef ooold be tested 

15 against a pand of antiT>odiesM*ere certain antibodies are ^pedlfc to nddetedr^ion. 
The abaenoe of antibody binding would be indicative of * dektioii and, thetefoze, a 
modified Nef protein and in turn a putative no«H»thogenic fcrm of the virus. The 
biological sample is a sample likely to contain the modified n^SfiOC pcodnct such as 
tissue extract or ceU extract of an mfected cell. However, where the modified »«/gene 
20 pfoductiscqpd)leof penneation or ttansport out of tibe cell, suitable Wdo^cd 

would include terum, whole blood, lymi^ and mucosal secretioo amongst ofljer fluids. 

Mk^ variations in the aforementioned assays are possible and aft co n tt mpl a ie d herein. 
For exaniple, an ass^ could also be based on die inability totting tpedBc antibocfy 
25 to !»nd to a modified ne/protein. For the purposes of Ae pceietit invenfion die term 
"contacting* including "mixing". 

Tte presence of a modified ite/ molecule, such as a molecula canying a deletion, or 
other suitaUe HIV-1 derived protein in biological fluid can be ddeded using a wide 
30 range of immunoass^ techniques sudbi as Hxax described in US Patent Nos. 4,016,043, 
4,424,279 and 4.018,65). These tnchide both single-site and two-site, oc "sandwich", 
assays of the non-compedtive types, as well as in die traditional ooaapedtive bindmg 
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assays md iiKjude ELISA and RIA technique Sandwich assays are among the most 
xiseful and commonly used assays and arc favoured for use in the jf»e»ent invention. A 
number of variations of the sandwich assay technique exist» and all are intended to be 
encompassed by the pcesent invention. Briefly* and by way of exanqile only* in a 
5 typical f^ward assay, a modified m/product-qwcific antibody is umn€biU$cd onto a 
solid substrate to fonn a first complex and the sample to be tested for modified nef 
product brought into contact with, the bound molecule. After a suitable period of 
incubation* for a period of time sufficient to allow formation of an antibody-modified 
w/product secondary oompleXt a scccmd modified nef protein antibody, labelled with 
10 a reporter nuricculc capable of producing a detectable signaL it tiien added and 
incubatedL allowing time sufficient for Ae formation of a tertiary cmnplex of antibody^ 
modified mf product-antibody. Any unreacted material ia washed away, 9od the 
presence of the tertiary complex is determined by observation <^ a signal produced by 
tibe rq>orter molecule. The results may either be qualitative, fay stm|de observation of 
15 the visible signal or may be quantitated 1^ coa^Mriaon with a conbtol aam{^ containing 
known annmnts of hapten. Variaticms of the forward assay include a simn lt aneoiis assay, 
to which both sample and labelled antibody are added simuhaneously to the bound 
antibody, or a revtarse assay in which the labelled antibody and sample to be tested are 
first combined, incubated axKl then added simultaneously to the bound antibody. These 
20 tedbniques are well known to Aoae ddlled in tibe art» and the posstbility of rninc^ 
variations will be readily apparent The antibodies used above may be monoclonal or 
polyclonal. In a imferred embodiment* antibodies to Nef in an individual's triiological 
fluid are screened for using pq>tides derived torn NeC. The abeenoe of antibodies to 
4>ocific regions of Nef such as amino acids 162 to 177 would be indicative of a ncm- 
25 pathogenic strain of HIV-1. All such assays and variations of iodi assays are 
encoa)|»saed by Oe present invention. 

The solid substrate is typicaUy or a polymer, the 

being cellulose, polyacty lamide, nylon, polystyr^ polyvinyl dbJoride or polypropylene. 
30 11iesoUdsq)p(^maybeintheformoftubes»bea^ 

surfiaioe suitable for conducting an immunoassay. Tte binding processes are well-known 
in the art and generally consist of cross-linking covalendy Undlog or physically 
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adsorbing the iiu>leculc to the insoliible ctrrier. 

By "reporter iiioIecule\ as used in the present specificatio&» is meant a molecule which, 
by its chemical nature, produces an analytically identifiaUe signal which allows the 
S detection of antigen-bound antibody. Detection may be eidier qualitative or quantitative. 
The most commonly used reporter molecules in this type of assay are enzymes, 
fluoroirftorcs or radionuclide containing molecules (i.e. radioisotopes). In the case of an 
enzyme immunoassay^ an en^me is coigugated to the second antibody, generally by 
meansof glutaraldehydeorperiodate. As will be readily recognised, howcfver, a wide 
10 variety of difiexent conjugation techniques ocist which are readily available to one 
skilled in the art Commonly used enzymes include horsendidb penncidase, glucose 
oxidase, {^galactosidase and alkaline phosphatase, amongst odicrs. The substrates to be 
used with the q)ecific cmymes are generally chosen for die pcoduction,«qpon hydrolysis 
by Ae corresponding cna^rme, of a detectable colour cham^ It is also possible to 
IS enqdoy fluorogenic substntfes, vMttk yield a fluorescent pcoAict 

Alternatively, fluorescent compounds, such as fluorescein and ihodamine, may be 
chemically coupled to antibodies without witubo^ thdur bindigg opaciqr, Wheoactivi^ 
by iUumination with li^ of a particufaM* wavclcagtfa, the fluofochrome-labelled antibody 

20 adsorbs die li^t energy, indudtw a state of excitability in the molecule, fi>llawed by 
emission of the li^ at a characteristic colour visually detectable with a tight 
microsoope. As in the EIA, the fluorescent labdled aritibody is aUoiw^ 
first antibody<4uqpten complex. Afler washing off the unbouiid reagent, die reniairung 
complex is then exposed to the light of the iqppropriate wavdeogdi, Ac fluocesceooe 

25 observed indicates die presence of die hapten of interest tmmunofluoitaoeiice and EIA 
tedmiques are both very well established in the art and are particularly pieftrged for the 
present m^hod. However, other reporter molecules, such as radioisotope, 
chemiluminescent or Uoluminesoent molecules, may also be employed* It will be 
teadily apparent to the dolled trchnifian how to vary the procedure to wit die reqtured 

30 purpose. It wiUelso be ^)parentdiat die foregoing can be used to 1^ 

product and to use same dlrecUy in the d^ection ot f<^ exan^le, circulatory antibodies 
specific to said modified m/ product 



-56- 

Altemstively, genetic assays may be conducted to screen for abberations in the n^/^gene 
and/or LTR region. Such a genetic assay nuty be by Sk>utfaera or Nordiem btot analysis, 
PCR analysis or tibe lite using oligonucleotides apediic to a deleted region of a gene 
and/or LTR region. 

5 

Accordiog to this embodltnent Acre is provided a method for determining the 
pathogenicity of an HIV-1 strain after said HIV-1 strain infects cells of an individual, 
said method con^msiflig detennining directly or indirectly tfie p reaen ce of a deletion 
mutation in tihe gemxne of said HIV-1 wherein die presence of a such a mutatioa is 
10 indicative ofthe|»eseticeofaiK>n-patfiogeoic strain of HIV-L The deletion mutation 
mqr result in the genome being unable to syndiestze a polypeptide or protein fitun a 
padiogentc strain of HIV-1 or may direct the syB^yais of a truncated <Hr deleted fimn 
of said polypeptide or protein. For exampICp a Nefpiotein with amino adds 162 to 177 
deleted tberefrom. like mutation may eiso lead to altered eoqxesstoa <tf a polypeptide 
IS detectable by, for example, decreased symbesas of a particular protein, audi as the m/ 
gene product Alternatively, the ddetion mutati<m aflfocts the LTR region or a 
rq[ulatocy r^lMof flieHrV*! geoMue. In dflier case, affeeted viiuaes may also be 
detected by, for example, observing low viral copy numbers such as low viral loads. 

20 Preferably, said non-pathogenic HIV-1 carries a deletion in its getiome of at least 10 
nucleotides wittiin Ae region from nudeotide 8787 to nucleotide 9709 using the 
nudeotide numbering of HIV-l)^}. 



Preferably, said non-pathogenic HIV-1 candies a deletion in ftt genome of at least 10 
25 nucleotides from within a region adected from dbe list oonsistuDig of: 
nudeotide 0) 8830-8862; 

(ti) 9009-9035; 
(ih^ 9019-9029; and 

(iv) 9033-9049. 



30 
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Ptefatably, said non-pathogenic HEV.l carries a deletion in its genome of at least 10 
nucleotides fiom within a region selected from the list consisting of: 
nucleotide (v) 9281-9371; 



(vi) 92«l-9362 

(vii) 9105-9224; 

(viii) 9271-9370. 



and 



Pfeferabiy, said noo-fMrihogenic KV-l cairies a deletion in itf gaMme of at least 10 
10 nucleotides from within a region selected fiom die list consisting of: 
nucleotide (ix) 8882-8928; 

(x) 8850-9006; 

(xi) 8792-9041; and 

(xii) 9112-9204. 

15 

Preferably, said non-pathogenic HIV-1 carries • deletion in its genome of at leatf 10 
nucleotides fiom within a region selected from the list consistiiig of: 
nideoade (xfii) 9105-9224; 

(xiv) 9389-9395; and 

20 (XV) 9281-9366. 

The above nucleotide numbers ate based on the nucleotide numbering in the N143 
gaxnoe. 

25 Particulariy preferred oligonucleotides are based on the deleted icgioot of the iw/gene 
and/or LTR region such as but not Kndted to one or more oligonucleotides based on 
SEQ ID NO: 2 to SEQ ID NO: 613 and/or SEQ ID NO: 652 to SEQ ID NO: 799. 

Most prefened deehions toclude deletions of one or more of SEQ ID NO:803 to 841 
30 which cover amino acids 162 to 177 of Nef. A particularly pnfetied genetic ass^ 
screens for tliJs deletion. 
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The jffeseat invention fbrtfaer extends to kits for Ac diagnosis of infection hy patho^e 
strains of fflV-l or for deteiminiog the pathogenicity of infecting virus. The kits would 
be in compertmental form eadi oon^sing one or more suitable reagents for conducting 

the assay. 

5 

The present invention is fuitiier described by the following non-limiting Examples. 

EXAMPLE 1 
Source Mattrial 

10 For the purposes of Ae following examples. noni»thogenic HIV-I strains were isolated 
from recipients of HTV-l infected blood. The recipients are designated "CW, "C54-, 
"C98", "C49", "C64* and "C124". The donor is identified herein as "D36". The place 
of isolation may be indicated after the abbreviation of "HTsr. For example, St Vincents 
Ho^ital, Sydney (HIVstv) <» Mscfiu-lane Burnet Centre of Medical Research, 

15 Melbourne (HIV,^. 

Exemphny viral isolates referred to herein as "CIS" and "C9r were deposited at the 
PHL$ Centre for Applied Miciobiology and Reaeaich. Eiiropett CoO^^ 
CeU Cultures (ECACC), Division of Biologies. Porton Down. Salisfaiay» WihAire SP4 
20 OJG. C18 w« deposited on 17 October. 199* under Provlsioiial Aeoession Number 
V94101706 and C98 was deposited on 31 Octoba. 1994 lOKkr Proviskmal Accession 
Number V941031169. Another isolate "CS4* was deposited at ECACC on 10 March, 
199S under Provisional Accession Number V95031Q22. 

25 Figure U is a summary of tiie deletion mutants of die present bvcntioa. 
Viniaea are isolatabte hf Ihe following procedures: 

1. Infected peripheral blood mononuclear cells (PBMCs) were oo^uhured wiOi 
HIV-1 aeronegative donor PBMCs. A oonvenlent aouree of aeroocgaitve donor 
30 PBMCs is a blood banlc The supematants and ceUs are harvcatad every 7 dqrs 

and fiesh medium added with CDS depleted PBMCs. CD8 deletion promotes 
Oe abiliQr to isolate HTV-l. The culture and procedure Is <?onH nue d for up to 



»»ia>^ii>«Ki»jniv».utAj» 



- 59. 

^proxiinately 5 weeks; 

2. The infected PBMCs eie purified fitan wbole Wood and tfaese ceUs are cultured 
alone for up to 5 weeks. PMBCs alone are used becauws the vinis it more likely 

5 to be monocytotropic. Fresh medium is added weekly and supenutart fluid is 

harvested at tibis time; 

3. Supernatant fluids are harvested every approximately 7 days, fresh medium and 
fiesh HIV-1 seronegative CDS dqplctad PBMCs are added at this time; 

10 

4. HIV-1 seronegative PBMCs are pretreated with with M-CSF for approidmately 
72 hours prior to the addition of infected PBMCs. M-CSF has been shown to 
enhance HIV-1 re|dication in monocytes (Oenddman •« «/, 19S8); and 

IS 5. Hie ii^>eraatant fluid is harvested horn the cultntea of stqp 4 every 
approximately 7 days» fiesh medium added together with HIV-t seroaegativ« 
TtinmlitH CDS depicted PBMCs. The virus is iaolated fiom die infected 
PBMCs. 

20 A pMticularly preferred method of isolafioQ is as follows: 

mV negative donor PBMC were stimulated 1^ culture in RPMI 1640 containing 10% 
v/v fetal calf setum(FCS). 15 mMN-2-hydroxye<liylplpcraane-K'-2-c<h> n raBi lfon ic add, 
0.1% w/v sodram bicarbonate with 100 lU/ml penicillm and 100 |igM streptomycin 
with the addition of 1 0 fig/M PHA (Welloome, Temple HOI, Daitfocd. Eogtsnd) fer 72 

25 h prior to co-culture. FreA patient cdls (10 x 10* cells) wcte Aen oo-adtured wifli Ae 

PHA-activated donor PBMC (10 x 10^ cells). ImmedUrtdy 00 00-calture 2 x 10^ of the 
mixed odl population were UV inadiatad (2S4mn, 300)iW/€ni?, 1 5iee>> added back to 
ttieieinainiogceU$andcultufedfor29da3rs. After UVtieatnieatoeUf woe lesu^pended 
at 1 X 1(^ oeUsAnl in RPMI 1640 containing 10% v/v FCS. IS mM HBPES, 0.1% w/v 
30 8odiwnbicaiboBate.2S|igMglutamine.lOOIUMpemciUttt^ 

2 ^g/ml polybiene; 4ftg/nil hydiocoxtisone (Sigma. St Louia; MO, USA), 20 U/knl 
ittteileukin 2 (Boefaringer Mannheim, Germany) and 120 nU/ml anti-interferon QCti 



»$o<ai^\»#^ii,nivi.WA,<o 
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Biocbemicals. Costa Mesa, CA» USA). Cells 2nainta]nad by ludf 01^ 
evety 3 to 4 igys after PHA sdmuladoiu with addttion of fresb stlmtOated donor 
PBMC oa days 7, 14 aod 21. Virus pix>duction was assayed for by cell-fiee reverse 
transcriptase activity (Neate #/ al, 19S7) or p24 activity (Abbott Diagnostics assay), 

5 

EXAMPLES 
DNA Prepararton ami PCR An^ifiemkm 
Noo-pathogenic HlV-1 (e.g. strain CIS) infected perii^ieral Uood mmomiclear cells 
(PBMQ wete harvested 4 days after infection of phytohaeinagBttitimn(PHA) stimnlatfd 
10 mv-l negative donor PBMC cultured by the mediod of Neate etal(19S7) and 

in phosphate bufifeied saline (PBS). PBMC from Donor D36 and Rec^ents Cl 8» CS4 
and C98 wore prepared by FtcoU isopaquc centrifiigation of biifi^ coat cells and washed 
with PBS. 

IS Approxtxnatdy 10^ cells were lyaed in Iml lysis solution (0.45% v/v NP40, 0.45% vN 
Tweca 20» lOmM Tris-Ha pH 8.3, 40mM KQ 2.5 mM MgQ^) Md digoted with 
«OMg^ proteinase K (Boehringer Mannheim) at 55% for 1 hour followed by 100% 
for 10 floimites. Lysates were stored at -20% 

20 All p<aymera9e chain reaction (PCR) primers (Table 1) and seqiimdug primers (Table 
2) were synthesized u$it« an Applied Bio^stema model 391 DNA synthesizer using 
phoqphocamldite chemistry. 

Strict phydcal separation was maintained for sample, PCRreagent mix atid PCR ceacti<m 
25 pf^q>arations as well as anaplification and analysis. Final reaction mixea(50|U) contained 

2^1 neat or diluted cettIy8ate,0.2tiM each iMinier^ 20^ 1^25 units Taq 

polymerase (Boehringer Mannheim) in PCRbuffer, (lOmM Trla-HQ pHSJ, 5QmM KG. 

100|ig/Sml gelatine) adjusted to tiie optimum MgCIj concentration fior fte primer pair 

(1.5-3.0mM). AUquoted reagent mix was overlaid S0|il minecal oil prior to 
30 addition of DNA template lysate. After teaplate denaturatioii at 94% for 3 min 

amidification was achieved with 30 cycles of 94%, 1 min; 55^0, 1 min; m^C, 2 mins. 

A final elongation reaction was conducted at 72% for 7 minutes. For double PCR 
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an^Iificalion 2 ^1 of first round product was added to the second leafent mix and 
ampiUM as before. 

IH:R amplified DNA was checked for qtiatity, quantiQr and fiagn^ 
5 electrophoresis in Tris-Aoetate-EDTA buffisr (Sambrook et al, 1989) stained in etbidium 
bromide and viewed by UV transillumination. 

EXAMPLES 
DNA Sequence Anafysis 
10 The DNA sequence of PGR amplified HIV-1 regions was dettimined by the 
dideo)^ueleotide method (Sanger etaU 1997) using Sequeoase T7 polymerase (United 
States Biochemicals). 

PCR amplified DNA was purified by PGR Mag^ pfq> resin difomatogrndiy (Promega). 

IS Approximately 2 to 7 |ig purified DNA phis lOng q)ecific primer (Table 2) were 
denatured by boiling for 3 minsaadstt^fioaa to -20"C The initial labelling reaction 
was for 3 minutes at 22*>C (room temperatuie) with '^SdNTP (SOOCa/inmol; Dupoot) 
fbUowedbydideoxynucteotidetetminationffeaotiamat37%fo5miflides. NP40.to 
0.45H v/v, was induded in denaboation and reactioa mfaiea (Badmutt ec al, 1990). 

20 Setjuencing reaction products were denatured in fotmamide and resolved on a 6% w/v 
polyacrylamide gel containing 8M urea, fixed in 10% v/v aoetio add, lOK v/r methanol 
and dried. FoUowing autoradiography on XKl film (Kodak) Ifae gel sequences were 
read «ssenri>led. translated to protein and aligned using the PC/OENE 

(InteUiOenetics. USA). 

25 
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TABLE 1 
PGR PRIMERS 



1 PRIMER 


SEQUENCE*' * 


POSITION^ 1 


a-i 


TOGAAGGGCTAATTTCK}T(6l6) 


1-18 


Cl-2 


ATCTTaXTAAAAAATTAGCCrOTC(617) 


20W-207S 


LTR-3' 


AGGCTCAGATCTGGTC7AAC(618) 


955j^-y54(; 


SK6S 


AGCAGCAGGAAOCACTATGO(619) 


7786-7805 




TGCTAOAOATnTCCACAC(620) 


9709.9691 


KS-2 


AGTOAATAOA<nTAOOCAG<3(«2I) 


8326-8345 


RT5'-v3 


gtaagacagtatgatx:aoata(622) 


2418-2438 


RT3'-v2 


TTOTAOGGAATTCCAAATKXXfiM) 


4660^640 


RTS'-v2 


CAGOATCXTACAOCTOTCAACATAATC^) 


2487-2506 


RT3'-vl 


GGGAATTCCTrATTCCTGCrrG(«25) 


46S5-4634 



1. Sequence is picaefMed from 5' to 3' of the primer. 

2. Poati<»»8ooo(dingtoftettumberiiigofinV-l in Mycn era/ (1992). 

3. SEQ ID NOs «e given in paiendieses. 
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TABLE 2 
SEQUENCING PRIMERS 



5 



10 



PRIMER 


SEQUENCE*' * 


posmoN* 


ICS3 


CCAOAAGTTOCACAATCC(626) 




KS4 


TTCTTCTAOGTATOTaGAG(627) 


t753-S735 


KSS 


AOTOAATTAaCCCTTCCAG(62S) 


9093-9075 


KS6 


TGCTAGAOATTTTOCACAC(629) 


9709-9691 


SP2 


TOCTCTGOAAAACTCAT(«30) 


t0064022 


SP3 


CTTTCTATAOTOAATAOAO(63 1) 


t318-t336 


SP4 


TATTOGAGTCAGGAACT(632) 


t618-t634 


SPR 


OOTCTAACCAOAOAOAC(633) 


9547^31 



15 1. Sequaoee ia nwitwd from S' to 3' of die primar. 

2. PoattkMbaooofdiogtotlieiiiinMngofHrsr-l ia^^ 

3. SEQ ID NOs are givw in paraidiese*. 

EXAMPLE 4 

20 Cells and Cett Ctdttire 

Periphenl blood was obtemed from HIV-1 Mco^negative vohiDteen aad mononuclear 
oeUs prepared by oentrifbgationonartoon/Hyp^ ^^)- 
PBMC vim activated with phytohemiiggltrtinln (PHA; I0|ig^0l*oeaf)fcf 48htt3r^ 
washed and AencottaredbRPNa 1640 medhim oontaming 10% vAr heat inactivatBd 

25 foetid calf aeran, ISmM HEPES» O.IH v/v sodium bicaiboule. 25)iitail polybceae 
(Sigma). 10% vfv interieukin 2 (Boehringer Mamiheim) and 1:1000 arti-toterfeRm 
(Miles) (IL-2 medium). Non-PHA stimulated cells were pcepaicd In a similar manner 
excqrt ^aey were cultured in medium lacking PHA and IL-2. 



M0«Bi#ita»ii«4>JIIV1.USA>4 
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EXAMPLE5 

Ant^tide-anOstra 

Antibodies specific for fflV- 1 Nef M«re raised against a peptide conesponding to Ae 
predicted amino acid residues 15-27 (AVRERMRRAEPAA SEQ ID NO: 634) of Nef 
5 encoded by tbeHIV-1 done pNWJ (Kemp etal. 1988). TTiepepUdeiwsa^ 
to keyhole limpet bemocyanin (KLH; Calbiochem, Behring Diagnostics. CA) vto 
glutaraldebyde and this complat used to immunise sheep (0.5mg peptide 
oonjugate/sheep). AntiTwdies to the peptide were purified by aflSnity chromatography. 
Rc«rtivity of the antibodies with recombinant HIV. 1 Nef 25 and 27 w demonstrated 
10 by immunoblotting. 

EXAMPLE 6 

R€acttvUyofanti~NrfftS.27^ wtth HlV ClS-bffected CeUs In JmmwwUottIng 
Seven days post-infectk»HIV.l d^infccted PBMCs and mock - infected edls were 

IS wadied in PBS then lyied (0.5% Wv NP-40. 0.5% w/v aodhan dewycholate, 50mM 
Naa 25mM Tris-Ha lOmM EDTA, 0.01% Wv aodinm aside and lOmM 
phenyfanethylsulphonylfluoride). After nuclei were «aB b«atei we 
dectfophocesed in a 13% SDS^jolyaoylamide gel (SDS-PA<») and subsequently 
tiansfened to Hybond-C nitroceUuIoae (Amenham. Bu rHn g h a nrfiht . England) fee 1 

20 h at 100 V using a Bio-Rad protein tiansfa cefl ^io-Rad, RWunond, Ca). 

were pre-incubated with 1% w/v BSA/PBSto2hat room teinpetatoire and Aen reacted 

withafBnitypurifiedsheepanti.Ne^,5.j7ydauted 1:100, ovet^ 
After three washes in 1% w/v BSA/PBS. the Wots were ineuiwiBd with donk»y anti- 
sheep Ig conjugated to biotin (Amenham. dUutod I JOO) ft* I h at room temperetnre. 

25 After eattensive washing as described Aovo the membsanet were incubaled w^ 
streptavidittKionjugated hoise radish peroxidase (Amersham; dihned 1:500 for 1 h at 
room temperature. All dUutions were made with 1% w/v BSA in PBS. Afterfiirther 
wadiing the monbrene was developed with pheoylenedianine sutetiate (Dako. 
D^Mpatts, Denmark). The antibody P««pawtion used in the Imaittnoblott^ 

30 was free of detectable antibodies to the immunogenic canier protein and coupling 
reagent. 
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EXAMPLE 7 
Analysis by Pofymerase Chain Reaction Ampt^eation 
A y fiaement defined by primers Q- 1 and 0-2 conttining the 5' LTR ind pat of tbe 
gag gene was ampBfied. DNA from HTV-l Cl« infected PBMC gave an amplified 
5 fiagment (amplimer) of about 1 .9 kb compared with 2. Ikb for pHXB2 control template, 
implying a deletion of about 200bp fiom fflV C18. Further amplification of this 
fiagment with primers defining the U3 region of the LTO (CM and LTR-SO gave 
amplimeis of about 300 bp for HIV-1 Ct8 infected PBMC DNA comp«ed with 340bp 
for CU and D36 PBMC DNA and 4«4 bp for pHXB2 control. TWf hnplies the loss of 
10 ^^proximately 140 to l«Obp fiom the U3 region of these pcoviial DNAs. 

To analyse the «</gene-3' LTR region, the nested primer pahs SK68-a-6 and KS-2- 
LTR-3' wew used in a double PCR. Amplhners of approximately 830bp were obtained 
for HIV-1 C18 infected PBMC DNA as weU as for PBMC DNA fiom Donor D36 and 
15 Redpieots CI S. C54 and C98 compared with approximately 1230bp for pHXB2 DNA. 
These lesttlts suggest that about 400bp of DNA have been delelod fiom Ae Donor and 

Redpient proviral DNAa. 



In oompariaon, amplification of the polymeraae gene regioa by double PCR widi the 
20 nested primer pain RT5'-v3-RT3'-v2 and RT5'-v2-RT3'-vl gaive a fragment 
(qiproxitnately 2.1 kb) the same size as the molecular done p[DCB2 fiagment for HIV-1 
C18 infected PBMC DNA. suggesting tfutt deletions fiom tfiis region were unlikdy. 



EXAMPLES 

25 NudeotitU Sequence of the ne/S * LTR ReghM 

PCR amplification experiments indteated an approxinuddy 200bp nideotide ddedon 
fiom both the iKf gene and LTR regions of Donor D36 PBMC and Rfldpkiit CIS HIV-l 
proviidDNA. Tofinther analyse these rcgiona, the DNA soqueoee was determined for 

file PCR amplified ite/.5 ILTR region of D36 PBMC» CIS isotatea HIYmk «^ ^^ftV 
30 as weU as isolate C98 HIV Infected PBMC proviid DNA. The 3' regioa was amplified 

with outer primers (SK68-CI6) and inner primers (SK6S-LTR 3' or KS2'C16) and 
yffqtffiyy^t dii«ctly usmg a number of internal sequencing primers baaed on die HIV- 
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^NL43 nucleotide sequence (Table 2). 

AUgnment of Ae nucleotide sequences of tbc anqplified 3' regioa of donor D36 PBMC 
and recipient C 1 8 isolates HI V^bc mv^^ and C98 HIV (Fig 1 ) fhowsd a number 
S ofnudeotide sequences changes, including deletions, relative to the nucteotide sequence 

of wild-type infectious fflV-1 (HIV-Inmj)- alignment D36 PBMC 

lacked 291 nucleotides, CI 8 fflVj^ dififeced in size by 388 nucleotides (comprising 
deletions of 397 nucleotides and an insertion of 9 nudeotidesX CIS HIV,^ differed 
by 4S6 nucleotides and C98 HIV lacked 158 nucleotides oonparad with HTV'li^,. 
10 Tbc overall identity with HIV-1^, nucleotide sequence of D36 PBMC, C18 fflVs^y. 
HIVmbc and C98 mv nucleotide sequences, including deletioos, wa» 73% (1 157/1596), 
67% (1459/1592). 62% (982/1592) and 79% (1105/1399X respectively. 

The D36 PBMC sequence differed fiomHIV-l>tt43 in a nianber of features. Achange 
15 in the wild type tat tennination codon fiom TAG to TCO (Ser) extended the third tut 
exmi («4itch starts at ^lice acceptor 10) by a fiitdier 15 amino adds to tetminate at a 
conserved TAG (Fig 1). The resulting C4eiminalpq)(ide is ricb in cfaatged amino adds 
(8/15) (Fig 2a). The wild type fw tenttinatton oodon has also dianged (TAG to GAG. 
Glu) and the third f«v exon is exieaded fbr 14 eodons to terminate at • conserved TAG 
20 (Fig2b). The encoded extra amino adds aie mainly polar (11/14) and cfaatged in natnre 
(Fig2b). The sequence also encodes the C-ternunal 237 amino adds ofEnvgp41 (Fig 
3) terminating at the normal tetmination codon. Hie D36 PBMC Eov amino add 
sequence has 85% identity widi the HIV>1^, sequence, increasing to 89% if 
nmilarities are induded. 

25 

Theie are aognificant differences fiom mV-l^u} downstream of the env (gp41) gene. 
A ehange in the fifth i^feodon. fiom TGO (Ap) to TOA ^ig 1). iflttoduces an early 
tennination in the D36 PBMC nc Tgene. ThecnoodedNefproteinisideotical totheN- 
terminaI4aminoaddsofHIV.lMM,Nef(Ftg4). FoUowing the early tennination tiiere 
30 aiedeletiaasof33.47,93and9l nudeotides and a region oflow seq[iMnoe homology, 
oompared with HIV-1. prior to the wild type n^terminatioa codoo site (HIV-1 nts 
9405-9407). As well as lemoving a significant part of the n^gene, Aese ddetions also 
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Mag into phase a further 6 tenni^ While the polypurine tract (plus strand 

I^uiier binding site) sad die first 38 nucleotides of the LTR U3 region are perfectly 
conserved, downstream U3 region sequences are disrupted by die 93 and 91 nucleotide 
deletions and the low homology region. The resultirc U3 ttg^on lacks recognition 
5 s erpiences for the transcn|ytion factors c*myb^ USF and TCFla as well as one of the 
suggested NF-AT sites (Gaynor et al» 1992). Downstream bom the 91 nucleotide 
deletion, a 59 nucleotide region of low homology contains two extra NFKB enhancer 
sites 19 nucleotides i;pstr6am of the usual site of a pair of NFKB sites, the upstream one 
of which is altered in its S'-half in D36. Se<iuences fitrther downstream are highly 
10 conserved with respect to HIV-1,^3, mduding the position and ntmber of Spl hasal 
^moter sitea^ TATA box, TAR and polyadenylation signal aaquenoea. 

Similar to D36 PBMC» the CIS HlV^y and HIV|^|bc sequences show d»e W diird exon 
to be extended by IS oodons. AU but two oodons (altered by point mutations) are 
IS identical 10 those of D36PBMC (Fig 2a). The rvvthiid exon of both CIS isolates is 
also extended (Fig 2b) but by only tfiree codons; identical to tfie first tfuee oodons of 
die D36PBMCm^ extension. Theaame237amlnoacidoodiagr%iQnof Envgp41 is 
found in both the CIS HIVDNA sequences (Fig 3) and shows 85H identify* iocreaslag 
to SS% if sintittarities are included, wiflli ttie same r^ion of Ibe HIV-1m>«j Bav gp41. 

20 

It is in the i^f gene and LTR regions fliat die nuyor diflfemioes fiom wild-lype HIV-1 
arise, just as in D36PBMC The n^geneofClSHIVs^ encodes 24 anuno acids with 
9oftl]«10N4erminalbetngfidenticaItotfieHIV-l|^3NefpcoCeinm Thereafker, 
deletionsof 177and 11 nucleotides cause a fiameshift and tennination it the 2Sdicodon 
2S (Figl). Downstream deletioos Of 120, 82 and 7 nucleotides Ci^ 

type fief gene sequence and bring into phase t futdier three tetmination codons. 

The Me/ gene of CIS HIV,^ encodes only 7 amino adds with ody the initiator 
metiUonineidenticdlotheHIV«l^Nef This low of idntily and early 

30 termfaiation is brought about by a 2S0 nucleotide deletion after the fifth nucleotide of 
the nef gene. Downstream deletions of 120 and S6 nucleotides cause further loss of 
wild-type we/gene sequences. In both CIS isolates there is perfect conservation of the 
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polypuiine tract and 29/3 1 nucleotides at tbe 5' end of the U3 t^on iounediatdy before 
the 120 nucleotide deletion (Fig 1). This deletion together with the downstream 82 and 
7 nucleotide deletions in HIV^^v and 86 nucleotide deletion in WP/^^ and the low 
homology region cause die loss of tiie 5' half of tiie NRT-1 site (Yamamoto et al 1992) 
S and the downstream NFAT site. A third NFKB site is pteseat 31 (HlV^y) and 33 
(H^f4Bc) nucleotides upstream of tfie expected pair of NFKB sites vriiich are 
themselves separated by I3nudeotidesjn$tead of the 4 mid The 
S'-most Spl site sequence is slightly altered but sequences downstream inchiding (be 
other 2 Spl sites; the TATA box, TAR and polyadenylation signal sequences are 
10 identical to HIV-l)g|^3 sequence. 

The tiuree sequences, D36 PBMQ CIS IT/siv ^ CIS HIVmbc a number of 
similarities consistent with the transmission of vims fiom person D36 to persm CIS as 
wen as a number of differences indicating post-transmissiaa divetgeoce of sequmce. 

IS AH three have open reading frames (ORFs) extended by 15 codons. All three have 
extended rev ORFs. The new rev tenninatiottcodoo in both CIS HIV-1 isolate^ three 
oodoos downstream of Ae HIV-1|4|^ rev termination eodoo^ has a point nmtation in 
D36 PBMC to make a Gltt codon so that it continues for a further U codons (Fig 2b) 
to terminate at a conserved TG A. The partial Envgp41 amino acid soqueaees are more 

20 closdy related to each other (86% identity or 90% including sfa 
(85% and S9%, req)ectively). 

The nucleotide sequence of the ii^and LTR region of tiieHIV*! isolate fiom recipient 
C9S (C9S mV) is 90.3% identical (1264/1399) to ti^ HIV 1 sequence* ignoring 

25 deletions. Similar to tfie D36 PBKfC and CIS fflV^ and HIV,^ isolates the 
HIV sequence du>w8 the tihird cxon of tor to be extended by IS cod«» 
being identical to tiie D36 PBMC tor extennon. Also, the r«v gene is extended by 3 
codons, 2 of >vhtch are identical to tiie first 2 codons of tiba D36 PBMC rev extension. 
The sequence also encodes the C-tennitial 223 amino adds of Env^ tenninsting at 

30 ihenonnaltenninationcodoit The C9S HIV Bnv amino add aeqoenoe has S9%identiQr 
wittk HIV-1 Env sequence* incttaang to 92% of shnllaiities are included. 



10 



IS 



20 



-69- 

As with the D36 PBMC and tl» C98 mV isolate sequeoces h i* the iK^ 
regions that m^ot diflferenccs fiom the fflV-l sequence arise. The n^T ««e open 
feadintf fiamc of C98 HIV Is much longer than in D36 PBMC, CU HIVs,v «nd 
HIVmbo eoeoding 85 amino acids compared with 206 amino acids fcr lOV-lj^y 
Sixty eight of those «5 amino acids are identical to the N-teminal seqoeaoe of fflV- 
l^jNct Thesingle,smafldeIetion(16nucleotides)mthoC9SHIV#i?MooeregioM 
CTabfc 3) occurs after iic^codon «2 causing a finameshift and teiminatlon afte 
3 codms at the start of die highly conserved polypurine tract seqoeaoe hnmediately 
hefoie the 3'-LTR. The ruflLTSL legion has tHO 4eletiaos totalling 142 nucleotides. 
The S'-most deletion of 42 nudeoddes includes Ae qilice aoo^tor 12 sequence. The 
^T'UdytdsyttmBUymdn^ttspooxdaM However, 
the downstream 100 nucleotide deletion includes sequences ftom the 3' end of the S'- 
NF-AT and aU of the 3' NF-AT sequences as weU as the USF transcription fiwlor 
recognition sequence. The downstream low homology regioo of 77 asideotides lacks 
the TX7-la sequence but has two additional NFKB sites 13 iwc k otidrs apart and 26 
mK' i^.iriiU. iiptwimi «f the 3'Aalf.«mniant of <he nonnal y44FKB she. Sequence 
downstream, mduding the 3'-NFKB site, the 3 Spl sitei^ TATA box TAR and 
polyadenyltfioo signal sequences sie all higUy conserved. 



The »— i« feature of Ae sequences is Ae series of ddetioas, with reject to HIV-l, in 
the iM/gene-3'-LTR region. These can be grouped into two regioM n«nely the nef- 
alone region, &at part of the ne/gene upstream of the LTR. and the n^TR region, 
where the iKfgene and LTRU3 legions overiap. The deletions m these regions of each 
of the sequences start and end at Ae same or stmilarpositiona (Table 3). Theddetions 
25 are larger in C18 mVjjy and CU HIVmbc to*^ ^ '97 and 456 

mideotide$hw«beendeleted(reladvetoHIV.l,n>»3) ■ndeotides.ftom 
D36 and 158 mideottdas ftom C98 HIV. In the ii«r-«lone wglon the two deletions 

CI 8 HlVgiv *e C18 mV^jBc 

deletions in D36PBMC. Similariy.tfieiic^TRregionfathsflwcdeietioosmtheCW 
30 HlVgiv* ti» two deletions in Ae C18 mV^BC ■«* 5^)36 PBMC sequences occupy 
die same re^tm. These findings in^cate tfiat mutant virus was transatftted fiom D36 
to CIS after which Airther deletions and rearrangements occurred. Similarly, the 
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sequetice of C98 HIV in tbe n^/TLTR region indicates two del^ons occupying the same 
region as the deletions in D36 and the CI Ssequ^^ However, the sr» (only 

16 nucleotides) and tbe position of the deletion in the mf^aat region of C9S HIV are 
disdxKiL bom those of tbe D36 PBMC and CIS sequences. 

5 

Tbe timing of transmission of vinis by tra^^ 

appfoximatdy 19 months after C98. Consistent with the relative tiniing of traoon^ 
and the sequence similarities and dlfiferences is the suggestioe that at the time of 
tnoismission to C98, tiie D36 sequence had ddMons in the nfffLTR regioii but ix^ in 
10 ne/alone region. Afler transmission to tfie 098 vinis demloped fU^ 

and rearrangement including tbe deletion m fbe mef-ziom r^ioo. The D36 virus 
evolved so dial at the time of transmissioA to C18» fistber ddetkms and feanangements 
had occurred incl u di n g deletion of aequenoe fiom the nef^otm regioii distinct fipom die 
C98 HIV ic^alone regicm deletion. After transmissim to CIS, ftvtfier ddetkms and 
15 rearxangementi occurred in die CIS vinis giviog rise to at least two aaquenoes (HTV^y 
«0dHIV^. 

The M^Mone deletioii regioo may be ft mutatioo w recombinatioo 'liotapot*' as it 
mchides aaqueoces thai were found to be variably dup^^ 
20 sequences derived from 8 of 12 patients analysed In a stud^C^^ Tbe 
aequenoe between the ne/^alone and the w/TLTR r^km ddetkms is highly conserved 
and is importaiit in proviius integratkm into tte iii^^^ 

a number of ceUular proteins. It is interesting ttiat die aequenoe equmdent to HIV- 
>NU3 nucleoddes 9209 to 9225 is retained in D36 and C9S HIV but ia the CIS 

2S HIV sequences. Tliis indudes part of m aequenoe of dyad ^ymnaetiy (^10 to 923 1) and 
is a significant part of the binding atte for NRT-1 (Yamamoto et al 1991) which has 
been Aown to have a negathreregulatMy effect on HTV The ptesenoe of 

this aequenoe in D36 and C98 HIV and hs absence from dia CIS Isotalea miy 
with the indMltty to isolate virus from D36 PBMC a^ 

30 but the abitity to isolate HIV-I fiom CIS PBMC The deletkm of aequenoe equivalent 
to nudeotkles 9281 to 9395 of HIV-1|^3 causes the loss of acme tnnacriptkm htiux 
bhidmg shea bicludingNPAT and USP fiom the D36, CIS HIV and C9S mv 
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A further stmiLuily between the D36> CIS HlV^y, CIS HIV,^ ind C9S HIV 
sequences is a regkm of low horm>logy to HIV-l^^ exteoding dawostitam of the 
neflVTK deleted itgion to the NFKB enhaucer/Spl promoter site rqpoa. This low 
homology r^on m £ict C(»isists of incomplete duplications of part of the NFKB/SpI 
5 region (Fig 5) lesultiD^ in D36 and C98 HIV having 2 extra NFKB stftes upstream of 
an altered 5' NFKB site while the CIS sequences have one extra NFKB site and altered 
spaciog between fiie 5' and 3' wild type NFKB sites due to an insertion of 9 nucleotides. 

For the CIS and C98 HIV-1 isolates virus replication was asseased in PHA-stimulated 
10 and non-stimulated PBMCs (Figs 6 and 7). In PHA-stimulatod PBKfCs we ahK> studied 
cell surface CD4 and IL-2R esqxession (Fig 8). In compariacm witfi HIV-1 wild*type 
SI and NSI isolates dearly bodi CIS HIVj^ and C98 viruses are rqdication 
competent, though C98 HIV rqplicates more poorly than CIS HIV,^ and are of the 
NSI pbeootype when syncytium fixmation and CD4 and IL*2R surftee expression are 
IS taken into account Additionally, and more suipfisir^y, these two viruses replicated 
ataxMt as eflScitntly in non-PHA stimulared PBMCs coi^pared to a typical local 
wild type SI isolate (HIV*1 22S200» Fig 7). 

When protein expression was assessed jfor CIS HIV|^ and 098 HIV|^ although 
20 structural proteins were identified, no typical Nef protein was seen tn infected cells. 
However, analysis of ceD lysates psepared frcmi PBMC infected widi CIS HIV|^ or 
PBMC infected with C98 HIV,^ (which were suhsequootly stimulatrd by UV 
irradiation, see Valerie tt al, 19S8) by Western immunoUotting usng two antibodies 
q)6dfic for die N-teriniiud region of Nef showed die presence of srnd^ 
25 IcDa and 21 kDa, re^ectively. These proteins were not observed in modc*4njeeted 
control PBMC lysates atid were i)0t observed when d^ inSKted^einyaa^ won probed 
were probed wiA antibodies reactive with the Otenmnus of Nefl 

Thus, althoui^ die CIS and C98 RIV isolates aie leplieafioB eon^eleat tn vUro tibey 
30 clearty replicate dififera^ using di£foreat conations fat odl activ«liQik and from tibe 
known functions of mV-l Kef protdn and the LTR show tftat the mi|i^ 
ntf gene and/or the LTR is at least In part leqnnsible for the outoome of infection. 
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implicating the importance of Nef and/or the LTR in the clinical outcome of infection 
m vivo, 

EXAMPLES 

5 DetermimOion of Degree of ReUaedness Between Viruses 

To determine the d^xw of xelaledness between viruses such as between muunts or 
between mutants and a wild-type vims and to ascertain putative inftcted patients^ the 
method of Delwait et at was employed. 

EXAMPLE 10 

Immune responsiveness of subjects (r/ected by non-pathogenic HIV'l isolate 
In this example* the donor and recipients of the cohort were tiasue typed and assessed 
for basic cdluhr immune responses. Proliferative responses and IL-2 production to the 
mitogens ConA and PHA, to allogeneic mononuclear calls Orxadiated pooled 
mononuclear edUs from 20 random donors) and to recall antigens (e^ Influenza and 
tetanus to»>id) wm wifliin normal ranges. While at the inunuaogeaetic levcL HLA 
typing ftiled to identic a consisteody common allele or luqdotype within the grotqp. 

The conservation of CD44- counts observed in the cohort, the relative integrity of their 
20 immune system and the varied HLA types of the donor and recipients ftnther stqpports 
the &ct that the symptomless condition of the cohort members is due to a non- 
patfiogemc strain of HIV-1 or a strain of low virulence. 

Accordiiigly, diis provides a screening procedure for sulgects putativdy infected by a 
25 noihpathogedcHIV-1 isototevvhetc such sulgects are seropositive fcrffl^ 

antibodies to an HIV-1 glycq>rotein) yet have xMrmal pcoUfefatxvc leqpoases and 
cytokine production to mitogens^ allogeneic mononuclear cdia nd to lecall antigens. 
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EXAMPLE 11 
Climcal Immwwlogy of Cohort 
To estAblish that the donor and the recipients belonging to the cohort exhibit nonnal 
5 immunological profiles, members of the cohort were assayed for CD4, CDS, 
]ym|rfiocyte county CD4/CD8 ratio and p-2-miaoglobulin over time since 
seroocmversion. 

P aram eters oonstdered nonnal in non-in&cted individuals range as follows: 









CD3 


55-82% 


620-2200 (xlO'^/L) 


OM 


29-58% 


420-1410 (xlO^/L) 


CDS 


12-43% 


200-980 (xtO^/L) 


Lymphocyte count 


1000-3500 (xlO^/L) 




CD4/CD$ 


0.7-3.7 




^-2-niieiiogIobiiliD 


0.00-2.20 mg/L 





20 The results are shown in Figures 10(aHg) and clearly show tfiat tfie immunolo^cal 
profiles of cohort members are substantially normal finlher highlighting the non- 
pathogenictty of tfie HIV-1 isolates of the present invention. Figure 10(g) shows a gnph 
of the Kaplan-Meier (Ox and Oates, 1989) estimates of tinae to disease progression 
(AIDS or CD4 > 250)* The results demonstrsle tiiat flie diffcscooe is laige in spite 

2S tibe conservative bias» wiOx a median time to progression of 6^ years in tibe main 
database. An exact logrank test (C^l Software Corporatira, 1^ 
Software for Exact Nonparametric Inference. Cambridge, Massachusetts.) was 
perfbnned, demonstrating that the dififercDce between the groups was higlily statisticany 
significant (logrank statistic U.8, pO.OOOl). 

SO 
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EXAMPLE 12 
Seqtienctig of isoUiU HIV-I C18^^ 
The goiome of variant HIV-1 desigoated CIS HIV-1|^ was amplified by the 
5 |K>lyiaeta$echamieactk>a(PC^)a$7oveilsf)iM^ 

outer oligmracleotide primers, designed using die pragnmme PCRPLAN 
(IntdliGeneticsX listed in Table 5 and eidier UlTnu (AppUed Biosyitems) or a mixture 
of KlenTaq and Pfii (KlenTaq LA. Ab Pqiddes Inc) polymetases (for fiutfiful 
anvlifieation of long fii«maits). The resulting fragment* were eloacd into the Smal 
10 site ofdiepiasaiid vector pOEM7Zf<-. Inseit-oontaining clones representing each region 
of die full length variant HIV-1 were sequenced by « nested deletloa strategy (Cou ft 
Wii, 1982) and cycle sequencing widi Taq polymerase and dye labdUed primers 
conqdimentary to tfie T7 or SP6 sites widiin the cloning vector. Nucleotide sequ e noes 
wttc entered and collated by ASSEMOEL and SEQIN (ImetUOeoetica) and SEQED 
IS (Applied Bio^ystems) tod translated to the encoded amino add saquenoes using 
TRANSL (Intelli(3enetks) progiammes. Sequence aligniaeats vied NALIGK, 
CLUSTAL (fntdliOenetics) and SEQED pi u g t amm e*. 

The fldl ki«di aequenoe (Pig 9: SEQ ID NO-JOO) of Isolate mv-l C1<mbc ^ 
20 nucleotides long vAich Is 506 nucleotides thoclor dum the HIV-1 sequence. This size 
differenee is comprised of 126 nucleotides of insertlotts and 632 nucleotides of deletions, 
seeTable6. The nK>st extensive differences between die HIV-1 CIS^bc sequence and 
Hrv-1^3 aie in the U3 legioo of die LTR and in the iH^ geasb » hereinafter 
described. 

25 

The 5* LTR has deletions of 120 and 87 nucleotides and n tegioa of low sequence 
homology. wUdhi is die resuh of an imperfect duplication of the downstream NFkB and 
Sj^l response aequeaces. lliese result in die loss of sequenoe from a number sites 
important in die regidadon of transcriptioo of HIV-1 genes, t nc h i din g die negative 
30 req;)(maedeflMnt(NR]^ and die re^onseeleaientsftnranuBdMr of transcription fr^ 
including NF-AT, MRT-1, USF and TCF-1a. Fwdienttore, dm low bomology region 
oMitains an extra NFkB and Spl rites as well as an insertion of 9 nudeotfdes between 



. 76- 

die usual NFkB sites. Downstteam of the NFkB sites the sequence of the LTR has a 
high level of hMBology (96Mi) with the same legiott of HIV-1. 

The gag gene contains 3 insertions, which iqKcaen t diieet rq^etitioos of adjaoent 
5 sequences. The first is a perfect repeat of 15 nucleotides after the equivalent of 
nucleotide 1134 of HIV-1,^3 and adds S amino acids to die C-tenninus region of 
pl7^. The remaining 2 insertions are imperfect «ad perfect repeats of 30 and 6 
nucleotides, respectively, after the equivalent of HIV-1 nucleotides 2163 and 2232. 
req)ectively. Iliese encode an extra 12 anutio acids in the C^arminusr^ion of pi S<** 
10 just downstream of the gxv to /no/ fiameshift sequences. The variatkm in sequence 
length of die gag geae at these two positions is urmiual. The homologjr of the encoded 
amino add sequence of HIV-1 CIS,^ and HIV-l for die gag pl7, p24. and pl5 
proteins is S7.m. 93.S% and 94.3H, respectively. 

15 In the po/ ORF, die encoded proteins have high homotogy witfk die HIV*l|4i^3 
sequences heing 95.5% overall oonvrising plO protease 92.9K, p66 revcrae transcriptase 
95.4H and p34 integrase 95.8%. The amino add sequence of the p61 RT lacks die 
mutadoas assodated widi resistanoe to die mideoside (AZT. ddl. ddQand non- 
nodeoside (Nevirapine) analogue drugs used in the treatment of KHM-infecied persons. 

20 

The v^gene encodes a 192 amino add protein widi 88.0% homology widi diit of HIV- 
1. The ipr gene eoeodea a 96 amino add ptotetn widx 89.6% homology widi diat of 

HIV.1NU3- 

2S Thete are 2 insertions and 1 ddedon of sequences in^vpu seme. The insertions of 
3 and 9 nucleotides are after the equivalent of nucleotide 6071 and 6234. respectively, 
of HIV.lj^3. These add 1 amino acid after amino add S, and 3 amino adds after 
amino add S9 of the eiKoded Vpu protein. Hie ddetion of 12 nudeotides after tiie 
equivalent of HIV*1,^| nudeotide 6261 deletes 4 amino adds fioitt tfie C-tetminal 

30 region of F>» as well as fiom the signal peptide of the <»v polyprotdn. which is 
encoded by an ovoiapping reading frame. Amino acid sequence homcdogy of HIV4 
CISmbc HIV-1)^, is 85.2%. 
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The sequeiMM eneodiog die em> Q>120 has 9 insertions tot^ 

IS taaoo tads) and the deletion of a total of 18 nucleotides (eaooding 6 amino adds). 
These are listed in Table 6. All of these events Onsertioas and ddetions) are at positioos 
in the «nv gene. This is wiOin die tnv V3 coding region, inunediately apstream of die 
5 sequence encoding die so called V3 tip (or loop) amino add aequeace^GHy Pro GlyArg. 
Ilw V3 r^on sequence is diat of a typical clade B subQpe (Nocdi Anierica» Europe a^ 
Australia) being identical to die .dade B consensus aequeaoe (based on 1S6 env 
sequences) at 29/35 positions. The type ofamtno add at positions 11 and 2S of the V3 
loop legion (wfaeie posidon 1 is die (^s at amino acid 2d6 of die cnv gpl20) is 
10 piedictive of die vitdnoft-^iicytium/ syncytium f<«ningphenoQrpe(Foucfa^ 

1992). Tbe HIV-1 CIS^bc ^'^ 9^ encoded amino add sequence has Ser at position 
1 1 and ne at pontion2SofdieV3 loop legion. The lack of a positively charged amino 
add at bodk positions is strongly indicative of ft non-syn^um viral phenotype. The 
overall amino add sequence homology widi HTV-Iy^j^ Osnoring ddetions and 
15 insertions) is 96.1%, comprising 85.5% for die gpl20 tegion and 87.6% for die gp41 
r^ion. 

Bodi diefttf andfw second exon open leading firames (ORF) an longer dian in HTV- 
^WAS' A change of the termination codottfiomTAO to TCXJeKtnds die ittf ORF 
20 ton downstream in |diaae termination codon extending die encoded tat amino add 
seqneiiee by 15 reddues, oompaied with die 86 amino add long HIV-1m|43 ifltf protein, 
(o a total lengdi of 101 amino acids. However, diis is die usual lengdi of die HIV-1 «tf 
protein. 

25 Similarly, die normal nv tennination eodon b changed from TAO to GAO. This 
extends d>ey»v ORF to an in-phase termination cod<«i 3 codons downstream so dial the 

encoded Rev protein is 119 amino adds long instead of die usual 116. 

As ToexOiooed above the most extensive difTerences between the sequences of the isolate 

30 HIV-1 C18mbc*<^^-W3*^^^'^/8^"^^'''^'^^ ^^^^'^ 
gene overlaps die 3' LTR, these differences are found in boA tbe iie/alone and the »cf 

/LTR overly legions. Ilie HTV ClS^sc'enooded m/protdn la only 24 ainino adds 
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loQg cmi^MDed with the iMMnnal length of 206 amiao acids. This severe dtonening of 
the n^pfotetn is due to the deletion of 18S nucleotides (the 177 end 11 nucleotide 
deletions) fiom the n^Mone region which also brings into phase a tetmination codon, 
TAO, at the resulting 25th eodon. Downstream tbeie is fiiither loss of potential iM^gene 
5 sequences by the 120 and 87 nucleotide deletions situated inibc nef/ LTR overlap 
legion. The resulting 24 amino acid n^protein is identical lo die N-terminus of the 
HIV-l„L4j ne/at 9 of tfte fbrst 10 positions. Thereafter, homology is lost conQ>l^ly. 

Some sequences used in the geoeration of mature ntRNAs are altered or lost in CI i^^BC- 

10 The dinudeotide immediately after the spUce donor site 2 (SD2) at nti 4SI8.4819 (HIV. 
1^3 equivalent nts 4963-4964) is changed from the oonsetved OT to OC. It is 
expected that this change would lead to loss of function of tfiis site as a aplioe donor. 
Splice donor 2 is used in die processing of HIV- 1 transcripts to some of the mRNAs that 
eflcode Tat. Rev and iKrproteins. Similarly the ^lice acceptor site 7 <SA7) sequenoe 

IS at nts 6477-^78 (mv-l,^ equivalent nts 6602-6603) is changed from the ooosetved 
AO dinudeotide to TC This change is expected to lead to Iom of ftmcdon of this site 
as a q>lioe acceptor. While this SA site is used ia HIV-i mRNA pcooeasing it is not a 
mi||or site and Is not used hi the production of the tegulatosy pnteins (Ta^ 
mRNAs. The spUce donor 12 she is dneot fiom the CISmbc wTMom (NL43 

20 equivalent nts 9161-9162) as it is wttUn the first deletion legion in the / LTR 
overlap region which occurs at nt 8797 and resuhs in the loss of NL43 nucleotides 9 1 OS 
to 9224. It is ngnificant fliat the SA12 site is absent from die sequence of aU of die 
cohort virus isolates so fiu: obtained as well as fiom die s equence of D36 PBMC, 
however, the CS4PBMC sequence does contain die SA12 site. SA12 is not used in die 

25 processing ofmRNAs that encode the viral regulatory proteins. Normally SA12 is used 
in iplidng in cotijuncdoo widi SDU 2, 3 and 4 and die tesulting spliced RNA U 
probably not n mRNA but may have a leguhttny role iavotving blading to oeaular 
protdns (Smidi et al, 1992). 

30 An interestiiig feature of die sequence of die HIV-1 C18,^is(^ is the deledon and 
rearrangement of sequence from die 5*-LTR U3 region and die deletion of sequence 
from die n^gene (bodi alone andnef/V LTR regions). These bdng die only 
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features of tbe sequence distinct from disease-causing HIV-L The lack of AIDS or 
AIDS-Uke symptoms tn the patient CIS is attributed to the effects of die loss of LTR 
sequetioe and / (m: the loss of m/coding sequnccs and their role in the pathogenesis of 
AIDS. 



TABLE 4 

Primen used to Amplify Overlappiiig regfons of EDV-l Cl9j^/fgQ 
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ct ac 
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CL cm 


Mn 
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so 




a. at 


J2S1 




30 




CL ai 


»» 




17 


TOOOOCMMQKniUMCMlCfCXOCftfiAXlilOC 


GL 1» 


I44i 




10 




CL a4 


•4aa 




at 


ooAitimcicwiciCTCTCicciiocr 



Underlined sequences dq;>ict added restriction enzyme site 

+ and - Mientations refer to sense and antisense strands of the double stranded DNA 
sequence 
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TABLE5 

Seqocncc Deletions and Insertfoni in HIV^l Cl^^fBC 
Compared with HIV-lj^|^ 

5 



<J«ne or tegicm Position (nt) Deletiono Znoertiona 

<^hmc ^3 (at) (nt) 

10 : 







29 


20 
mm 








S'-LTK U3 


iS 




fl7 






S'-LTtt TO 


154 


360 




A 

iM 




pi 7 


939 


1134 






15 


MO' d15 


19S2 












2001 






o 




vptt 








J 




%pti/«nv 


(092 






y 




vpu/ofiv 


<12l 








20 


«nv 


C402 


6620 


m 


4 




•ov 


6514 


6653 


2 






onv 


6524 


6665 


1 






Mir 


0630 


6772 




9 


25 


amr 


4640 


6770 




3 


«nr 


7011 


7141 


6 






etair 


7140 


7276 


3 






•nv 


7195 


7334 




6 




amr 


7266 


7399 


3 






anv 


7279 


7414 




6 


30 


aov 


7390 


7420 




2 




amr 


7200 


7429 




1 






7214 


7441 


3 






onv 


7463 


7593 




3 




onv 


7471 


7590 




9 


3S 


aaf 


8711 


0029 


177 






aaf 


0723 


9018 


11 






nar / LTR 


0790 


9104 


130 






»#£ / I*TA 


0054 


9280 


07 




40 


LTR U3 


0923 


9435 












m 


.121 
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EXAMPLE 13 
Macrophage Isolates o/HIV'I CIS and mv^l C98 

5 mv-l has been isolated £rom the nuKrophages of patients C\Z and C98. 

Patient monocytes v«e prqMred as follows. Wholeblood was spun at 200(>ipm for 10 
minutes. Plasma was removed into a scpaxsUbt tube and the ttmaining cells were diluted 
1:2 in PBS* (magnesium and calcium fiee piios{diate bafibned saline). This was 
10 underlaid with 10 ml of Fiooll Isopaque and qxm at 200Qrpm fi>r 20 minutes. Cells 
were collected from the interphase and washed three times with PBS*. These cells were 
then seeded into a 6 well Costar tray at a concentration of 1.0 x l(Plvol and allowed to 
adboe for 1 hour. Any non-adherent cells were removed by aipiratioa. 

IS Donor HIV-1 negative macrofribages f(V use in co-cultivatiott wm 

Peripheral blood mononuclear cells were purified fiom whok Uood using 
FicoU/Iaopaque density gradient These cells were seeded at a oooocatntioa of ZO x 
10^/mlintefloa PBMC were cultured k the presence of 3|igtal of FHA and lOOOUM 
of MrCSP 3 dqrs priw to co-culture. 

20 

On day of co-cuhure, donor PBMC were CDS depleted. Dyna beads coated with anti- 
CDS were used for this purpose. Dyna beads were washed once in PBS* and then 
applied to a magnet for 3 minutes. Supernatant was removed and the beads were then 
resuspended in 2S0m1 of RF-IO. Altquots of 2.0 x 10^ patient cdb were then added to 

2S 2S0mI (3 beads : 1 a>S T<ell)ofPyna beads and aUowed to incubale for 30 minute 
(m ice with occasional mixing. After 30 mimitea the ceU au^wnrioo fdaced onto 
a magnet for 3 minutes. The supernatant was then removed placed kft> a second tube 
containxQg 142 Ml (1 bead: 1 CDS T-ceU) of Dyna beads. TUaauapcnsiott was placed 
on ice fw an additional 30 minutes with occasional mixing. After 30 minutes cell 

30 s u spension was placed onto a augnet for 3 mimxtes. Si^ematant was removed and 
washed <MDce in RF-10. 
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Foreo<dture,CD8dq)letedPBMC were then added to prtioitin^ Hklfmedu 
changes were done eveiy 7 days for a period of 21 days. AUquots of 2J ml of medium 
was removed from these cultures and replaced with CDS depicted donor PBMC in 
Iscoves containing 10% HuS (Human serum), 5% v/v PCS and 5% w/v IL-2 and 
5 lOOOU/ml of M-CSF. Harvested supetnatants were spun at 1400 ipm for 10 minutes 

and stored as Imi aliquots. CeU peUets were lysed in 200^1 of lysis buffer for PGR 

analysis. In&ction was quantitated using a p24 EIA Kit. 

Cells were hsrvested from the co-culture* and used to pnpare DNA as described above. 
10 The iK^/3'-LTO region of both vinnisolatea was amplified by PCR using the ab^ 
described primer sets and conditions (Example 12). The refulting amplimers were 
cloned into the plasmid vector pT7T3U19 and die nucleotide sequence detennined by 
the Taq cycle sequeodng method with dye-labelled primera. 

IS Tlie CIS macnphage sequence hu 3 ddetions starting and fbidiittg at positions widu^ 
3 nuetootides of tiie same deletions in CI 8mbc The encoded Mcf is 3 nanw 
adds long compared with 7 amino adds for CISmbc Hie low bomology region of the 
LTR U 3 region of CIS maoophafe is voy similar in ae^Miiea to CISmbc 
similarly it has one extra upstream NFidB site. 

20 

On die other hand, die sequence of C98 macro^iage has a oMmber of diffinenoes from 
die C98 isolate. While it has exacdy die svne first deledoa of 16 nucleotides juA 
upstream of the polypurine tract (PPT), in Ac nc^-alone region, and aotaedy die same 
second ddeti<m Cpoatlon and size) it has an extra ddetion of IS nudeoddes at HIV- 
2S 1ml43 equivalent nudeotides»206 to 9223. The final deletiOD it in approximately die 
same posidon as in die 098 isolate but is 5 nucleotides longer. The encoded ncf protein 
is 34 •minft adds long compared widi 86 amino adds for the C9S Isolate. The low 
homology region is very similar to the C98 isolate, having d» same 2 extra upstream 
NFkB sites and completely lacking die nonnal 5*-NFkB rite. 



30 
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EXAMPLE 14 

Construction and Use of an Infectious Molecuiat Clone 

5 Molecular biolc^ical techniques can be used to construct e molecular clone of, for 
cxompte, HIV-I Cl9f,^ Two schemes may be used. In the first scheme genomic 
DNA« extracted fiom either the CD^ positive PBMC of the patjent CI t or donor PBM C 
that have been infected with the isolate HIV-1 C18|^ » ta uaed as the template fi>r 
polymerase chain reacti<m ampHficarion^ usintg thennostable poiymense of hifi^ 

10 transcription^ fidelity (eg UITma polymerase v KlenTaq/Plfii polymerase mixture), of 
long (6 to 7 kb) ovcri^iping fragments repxcsenting the S*- and 3*-parU of die HIV-1 
Cl^^^ proviral genome of total length 9207nts. The am(dified fragments may then 
be ligated together after digestion with a restriction enzynie tfiii de^ 
CMUDOn to the overlapping region of ibe amplified fragments^ for emqple ibc unique 

IS Bgl I <Mr Nco I sites. Ligation of this ftill length proviral DNA into a {rfasmid vector 
will allow its propagation in £ co/f and the suhseciueci piqwraliott of large (mg) 
qiuantities of tfiis molecularly cloned proviral DNA. 

In dte second scheme donor PBMC that have been infected widi the isolate HIV*] 
20 C18|^^ are used as a source of non-integrated proviral DNA which can be extracted 
fi^ the infected cells by the Hirt extractioa metfiod (Ifir^ 1967). Circular i^oviral 
DNA mdecules may be linearised by digestion widi a restricdon en^eyme tfiat cleaves 
at a unique position in die genome (eg the Bgll or Nco I ntes). The redhing linearised 
molecides can be ligated into a plasndd or» more usually^ a bacteriopha^ lambda (X) 
25 based vector (eg Charon 4a, XWES) after modification of Ae cad to provide blunt or 
ocrfiesive ends cocQ^Mdible with the vector. Transfonnation or Iramwhiction of g coli witfi 
the recombinant plasmid or bacteriophage material, respectively allows the propagatioa 
of the proviral DNA. Clones of £eon containhtg proviral DMA fnaj be selected and 
DNAprqparad. Molecular clones ofretrovird gnomes pnqpared in Oiswqf are 
30 permuted* RearraogenMnt to the fimctioiud arrangemem of aequeiioes is adu 

restriction enzyme cleavage and rcligation of fragments to reorastnict the correctly 
permuted proviral genome. 
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The HK^ecularly cloned DNA pioducts of both acfaemes can be used to pc^Mre variant 
provifal genomes that msy be used as the basis of a bioIogkaUy attenuated HtV-1 
vaccine strain. Similariy, they nuy be modified to cratarn extra DNA aecpiences m 
nef-mlone deletion region that deliver sequences that may be of therapeutic 
5 advantage (eg antisnse or ribozyme sequences). 

Infectious virus particles of HIV-I C18|^ , or modified virus» can be prcxhiced by 
transfcction of human cells (eg HeLa ceUs) which will produce, and release to tibe 
culture medium, vims particles of mv*| CISmbc ' ^ modified virus. These virus 
10 particles can be used to infect a variety of CD4 positive ceUs for further propagation or 
ej^peiimentation. 

EXAMPLE IS 
In vfuo Primate Model 

15 Following construction of infectious molecular clones of the mutant HIV-1 strains* 
studies are then undertaken in primates to establish anenuatkNO* inummogenid^ and 
vaocioe prophylactic efficacy. All studies compare mutant ckmea of HIV-1 with 
isogenic wild-type (WT) virus. Initial studiea are petfiumed usirig tfie taacaque 
9mmistrtna) model of HIV-1 infection, ^4acaque*itkfectiou8 WT KV-l and mutant 

20 dones are conovared with req)ect to duration of viremia* anatomic sttee of replication, 
and cellular and humoral immune responses. Where the mutattt HIV-1 clones induce 
an immune response in fte macques infected, challenge studies with WT virus are also 
performed. Studies are performed in a limited munber of ^A m ^ pft ^ i ff m , generally in 
parallel with the macaque studies. Relevant mutations are engineered into WT HTV-l 

2S clones iHcviousIy shown to produce chronic infection in chiinp^ and flie course of duinp 
infectioa with mutant clones con4>ared wiOi historical controls. If infection is 
established, WT challenge studies b also performed 

Examples 16 to 21 relate to the screoiing of antibodies in HIV«1 infbcted individuals 
30 to peptides covering regions of Nefl 
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EXAMPL£M 

Study Subj€ets 

Senim san^let we« obtained from seven HTV+ve individuals, D-36, C-124, C-98, C5-«, 
5 C-18. C-49 and C-54. Individuals CI24, C-M, C-64, C18, C49 and C-S4 (recipients) 
wete infected through units of blood or blood product from donor D36, over a 2-year 
period. Lottg-tenn foUow-up of the six tecipienfs and the doAor. afaowf persistent long 
tenn asymptomatic infection. This group is referred to herdn as the long teem non- 
pro g wssor 1 (LTNPl) cohort Members of this cohort have been infected for an average 

10 of 11 yean (10.75 to 14 years) and it was estaUidied that the donor had been infected 
since April, 19S1 (Leannont tt al, 1992). Regubr foUowiip inchides histoiy. physical 
examination, fliU Mood count, T-«eU subset counting and measurement of aeium p24 
antigen and p2-nucro|^obulin oonceitfratioos (Leannont ef al, 1992; Leannont et al, 
199S). As oontfDls, sera weie obtained from 14 HXV-1 negative Individttals (HlV-l-ve), 

15 4 patients who weie infected widi HIV-1 ihrongji socual nctivi^ or tbrough blood 
tzansflislon and aie also considered long-tenn non-progressois and 12 HIV-1 positive 
(HIV-l4ve) individuals that have developed clinical infections (long lam piogt««on, 
LTP). Sera fiom patients widi autoimmune disease and who were HlV-l-ve were also 
employed. 

20 

EXAMPLE 17 

Peptides conesponding to amino add residues 1 to 19, 20 to 36, 44 to 6S, 72 to 83, 89 
to 97, 109 to 114, 121 to 136, 162 to 177, 164 to 186 and 187 to 206 of HIV-1 nef 
2S (HIV-1,Q^3) were synthesized using standard t-Boe chemistry and purified by high 
pcessufc dnomatogrq^ as described dsewdieie (Feoondo <f <if. 1993). 

EXAMPLE 18 

EjqmssioH ofreeombtmoa WV-l Nef protein In E. ooH 
30 The large scale expression of dK 27 IdDafoim of mv-l)^ b £ coiif and sub9e(^ 
purification were as described 1^ Azad et al (1994). 
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EXAMPLE 19 

Screening of sera from HlV-J+ve indMdtuds and control HlV-l-ve groins 
fitr reactivity agalna Ntf prtrtein and derived peptides by Srect EIA 
For deteetiog antibodies tliat recognise the Nef protein or its peptide derivatives, highly 
5 purified ftU length Nef protein or peptides corresiMnding to ^ Nef amino 

add sequeaoe were coated onto the wells of 96- weU poly^yicne micrati^ 
ngAvelIor500n^wellinPBS,teq>ectively. Peptides and pcoteins wetcidlowed to coat 
for 2 h at 37*C After this incubation period the wells were washed three times widi 
PBS containing 0.05% vAr Tween 20 (PBS-Tween) and aqjr icmaining available sites 
10 on the wells blocked by incubation of 150^1 of gelatine (1% wAr) in PBS for 1 h at 
37*C Following wadung witii PBS-TWeen as described above, SOfil of Mrum diluted 
in PBS/BSA (1% wAr) was added to the wells and incubated for 1.5 h at 37*C. Sen 
added to tile wells hieladedtiiatfivm the seven eoh(»taaembersP36»C96, CIS. C54, 
C49. C64 and C124 [see Leannont et dl, 1992]) and die control groups described above. 
15 Ibe wells were again washed with PBS/Tween and subsequently innAaied witii SOfxl 
of biotioylated sheep anti-faunum Ig (diluted 1:1000 in 1% w/v BSA/PBS; Amersham) 
forlhat37*C FoUowing foither washing S0|il of Streptavidln-HRPCdihited 1:1000 
in IH w/v BSA/PBS; Dakopatts) was added to the welb and tf» plate ineii^^ 
for 30 flodn. An aliquot of 100|tl of substrate (O-phenylenedimine, Sigme) wu finally 
20 added after wadiingsndtiieplaieaUowed to incubate at roonitcmpactfute for 15 n^ 
The leaction was stopped by tiie addition of 1 K H2SO4 and tbe plate read at 450/630 
lun using a Utectek plate reader. 

EXAMPLE 20 

25 Recognition t^fidl lengOt recombinant Nef protein by patie$d sera 

The prevalence of aNef-qpecific antibody response in tiie oolwrC metnbers (tefened to 
herein as (LTNPIX long teim ptogressors QJTP) HIV-l-Hw faidividuab and anotiter 
group of long term noniprogressors (LTNP2) patients v/bo were infocted by different 
donors was assessed by EIA. Sera obtained fiom 14 ncxnuliikdividualsQIIV-l-ve) and 

30 14 individuals witii autoinunune disease (A/HIV-l-ve) were used as oontrols. 
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AU individuals vAio wete classified as LTP padents demonstxated high levels of 
antibodies that recognised fiill length Kef protein (Figure 12a). Sera obtained from 
HTV-I-vc or the A/HIV-l-vc groups showed ociy low levd recognition* which was 
5 considered at background levels* towards Nef protein (Figure 12bCl)-(iii)). In contrast 
to normal individuals* sexa obtained from die LTNPl cohort diowed hig^ recognition 
of Nef (Figure 12cX indicating Ae jiresence of significant levels of Nef antibodies in die 
sera of these individuals. The sera titered out to approximatdjr 1:3000. Similar levels 
of Nef antibodies were observed in the LTP gxoap (Figure 12a). Nefi>ositive antibodies 
10 were also detected in the LTNP2 group and again titered at 1:3000 diluti^ 12d)« 

EXAMPLE 21 
Recognition of Ne/^ivedpepiUks by $era 
Recognition of syndietic peptides* whidb correspcmd to anUno add sequences of Nef* by 
IS the LTNPl oobott was assessed. Various peptides were assessed lo defect those 
antigenic epitopes of Nef protein recognised by tfiese indtvidnals. Peptides 
ccHHM^iotidinglDaminoacklaequeaeesl to 19; 20 to 36; 44 to <S; 72tot3; 89to97; 
109 to 114; 164 to 186; lS7to206; 121 to 13S and 162 to 177 of HIV-1^ Nef 
protein were used to screen sera for the presence of apedfie anttbodka. Allaerafiom 

20 Ae LTP group recognised all Nef-<lerived peptides tested (pi^m 13a(iHx)). Sera 
titered b^ween 1:1000 and 1:10*000. Sera fitmi patients widi antoimmune disease 
diq)layed only low badcground non-specific recognition. Normal sen firnn HIV*l-ve 
individuals tested to date also displayed only badcground activity (Figure 13b). Serm 
from the LTNPl and LTNP2 g^ot^s also showed significaot reactivity against Nef 

25 peptides corresponding to Nef amino add sequenoes M9» 20-36, 44-65, 72-83, 89-97, 
109*114, 121-135, 164-86 and 187-206 (Figure 13cOHx) and d(IHx)). Sera from the 
LTNP2 groiqp also showed sigmficant reactiviQr against Nef pqrtide 162-177 (Figive 
13d(0<x)X similar to that showed by the LTP gmup, bdicatiiv that fhk 
was immunogenic. However, sera from the LTNPl cohort showed no significant 

30 reactivi^ towards pqptide 162-177 above background levels obtained with ncmnalH^ 
1-ve sera (Figure 13G(i)-(x)X ^»^^Wtnig tfiat this group of individuals were exposed to 
cells expressing a Nef protein which did not CMlain (his region. Wlule sera from the 
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LTNPl cohort did not react wi A the pqptide comqxKKUng to amioo add residues 162- 
177 of Ne( the sen Horn all patients did fecogntse a longer peptide, 164 to 186, which 
encompassed most of Nef 162-177. This cleariy indicates that the sera Pecognised 
antigenic qjitopes between 177-186. 

5 

llMse results deariy indicate diat all individuals fi»m the LKTPl cohort were exposed 
at some time to HIV-I infected cells expfessing a Nef piolein that only had a small 
deletion *inifomr"««'"g amino acids 162 to 177. 

10 The antibody testing has identified an antigenic r^on in the Nef protein wliich if 
deleted gives lise to attenuated mv-lviial strains. Hence,tastbigof the HIV-1 positive 
population may identify fUrther examines of hidividuab inftoted with a ttmuwt ri d viral 
quasispecie*. Additionally, lack of reoognitioii of this aattgentc epitope offers an 
antibody ass^ for testing animals aqperimeotally infected wifli an HIV-1 n^attenuated 

IS vital strain, m particular deleted in die legion covering Nef amino acids 162 to 177 
(relative to mv-l,^ Nef). 

Ilwae stalled hi the art will appreciate that the inveatioa deaeribed herein it tts^ 
to variatioas and modifications odier dum duMe specifically described. It is to be 
20 understood that die invention indudes all sudi variadons and mod i fic atio ns. The 
invendon also indudes all of die steps. Ibatures^ compositions and oonpounds r^med 
to or indicated in diis specificadon. individually or coUecdvely, and any and ail 
eombinadons of any two ct more of said steps or features. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANTS: Nicholas J. Deacon 

Jennifer C. Learmont 
Dale A. McPhee 
Suzanne Crowe 
David Cooper 

(ii) TITLE OF INVENTION: NON- PATHOGENIC STRAINS OF HIV-1 
(iii) NUMBER OF SEQUENCES: 841 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SCULLY, SCOTT, MURPHY & PRESSER 

(B) STREET: 400 GARDEN CITY PLAZA 

(C) CITY: GARDEN CITY 

(D) STATE: NEW YORK 

(E) COUNTRY: U.S.A. 

(F) ZIP: 11530-0299 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentin Release #1-0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/477,464 

(B) FILING DATE: 07-JUN-1995 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PM3864 (AU) 

(B) FILING DATE: 14-FEB-1994 

(A) APPLICATION NUMBER: PM4002 (AU) 

(B) FILING DATE: 21 -FEB -1994 

(A) APPLICATION NUMBER: PN0284 (AU) 

(B) FILING DATE: 23 -DEC- 1994 

(viii) ATTORNEY/ AGENT INFORMATION: 
(A) NAME: FRANK S. DIGIGLIO 

(C) REFERENCE/DOCKET NUMBER: 9606Z-I 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (516) 742*4343 

(B) TELEFAX: (516) 742-4366 



(2) INFORMATION FOR SEQ ID NOil: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



TGGAAGGGCT 


AATTTGGTCC 


CAAAAAAGAC 


AAGAGATCCT 


TGATCTGTGG 


ATCTACCACA 


C A 

60 


CACAAGGCTA 


CTTCCCTGAT 


TGGCAGAACT 


ACACACCAGG 


GCCAGGGATC 


AGATATCCAC 


120 


TGACCTTTGG 


ATGGTGCTTC 


AAGTTAGTAC 


CAGTTGAACC 


AGAGCAAGTA 


GAAGAGGCCA 


180 


AATAAGGAGA 


GAAGAACAGC 


TTGTTACACC 


CTATGAGCCA 


GCATGGGATG 


GAGGACCCGG 


240 


AGGGAGAAGT 


ATTAGTGTGG 


AAGTTTGACA 


GCCTCCTAGC 


ATTTCGTCAC 


ATGGCCCGAG 


300 


AGCTGCATCC 


GGAGTACTAC 


AAAGACTGCT 


GACATCGAGC 


TTTCTACAAG 


GGACTTTCCG 


360 


CTGGGGACTT 


TCCAGGGAGG 


TGTGGCCTGG 


GCGGGACTGG 


GGAGTGGCGA 


GCCCTCAGAT 


420 


GCTACATATA 


AGCAGCTGCT 


TTTTGCCTGT 


ACTGGGTCTC 


TCTGGTTAGA 


CCAGATCTGA 


480 


GCCTGGGAGC 


TCTCTGGCTA 


ACTAGGGAAC 


CCACTGCTTA 


AGCCTCAATA 


AAGCTTGCCT 


540 


TGAGTGCTCA 


AAGTAGTGTG 


TGCCCGTCTG 


TTGTGTGACT 


CTGGTAACTA 


GAGATCCCTC 


600 


AGACCCTTTT 


AGTCAGTGTG 


GAAAATCTCT 


AGCAGTGGCG 


CCCGAACAGG 


GACTTGAAAG 


660 


CGAAAGTAAA 


GCCAGAGGAG 


ATCTCTCGAC 


GCAGGACTCG 


GCTTGCTGAA 


GCGCGCACGG 


720 


CAAGAGGCGA 


GGGGCGGCGA 


CTGGTGAGTA 


CGCCAAAAAT 


TTTGACTAGC GGAGGCTAGA 


780 


AGGAGAGAGA 


TGGGTGCGAG 


AGCGTCGGTA 


TTAAGCGGGG 


GAGAATTAGA 


TAAATGGGAA 


840 


AAAATTCGGT 


TAAGGCCAGG 


GGGAAAGAAA 


CAATATAAAC 


TAAAACATAT 


AGTATGGGCA 


900 


AGCAGGGAGC 


TAGAACGATT 


CGCAGTTAAT 


CCTGGCCTTT 


TAGAGACATC 


AGAAGGCTGT 


960 


AGACAAATAC 


TGGGACAGCT 


ACAACCATCC 


CTTCAGACAG 


GATCAGAAGA 


ACTTAGATCA 


1020 


TTATATAATA 


CAATAGCAGT 


CCTCTATTGT 


GTGCATCAAA 


GGATAGATGT 


AAAAGACACC 


1080 


AAGGAAGCCT 


TAGATAAGAT 


AGAGGAAGAG 


CAAAACAAAA 


GTAAGAAAAA 


GGCACAGCAA 


1140 


GCAGCAGCTG 


ACACAGGAAA 


CAACAGCCAG 


GTCAGCCAAA 


ATTACCCTAT 


AGTGCAGAAC 


1200 


CTCCAGGGGC 


AAATGGTACA 


TCAGGCCATA 


TCACCTAGAA 


CTTTAAATGC 


ATGGGTAAAA 


1260 


GTAGTAGAAG 


AGAAGGCTTT 


CAGCCCAGAA 


GTAATACCCA 


TGTTTTCAGC 


ATTATCAGAA 


1320 


GGAGCCACCC 


CACAAGATTT 


AAATACCATG 


CTAAACACAG 


TGGGGGGACA 


TCAAGCAGCC 


1380 


ATGCAAATGT 


TAAAAGAGAC 


CATCAATGAG 


GAAGCTGCAG 


AATGGGATAG 


ATTGCATCCA 


1440 


GTGCATGCAG 


GGCCTATTGC 


ACCAGGCCAG 


ATGAGAGAAC 


CAAGGGGAAG 


TGACATAGCA 


1500 


GGAACTACTA 


GTACCCTTCA 


GGAACAAATA 


GGATGGATGA 


CACATAATCC 


ACCTATCCCA 


1560 


GTAGGAGAAA 


TCTATAAAAG 


ATGGATAATC 


CTGGGATTAA 


ATAAAATAGT 


AAGAATGTAT 


1620 


AGCCCTACCA 


GCATTCTGGA 


CATAAGACAA 


GGACCAAAGG 


AACCCTTTAG 


AGACTATGTA 


1680 


GACCGATTCT 


ATAAAACTCT 


AAGAGCCGAG 


CAAGCTTCAC 


AAGAGGTAAA 


AAATTGGATG 


1740 
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ACAGAAACCT 


TGTTGGTCCA 


AAATGCGAAC 


CCAGATTGTA 


AGACTATTTT 


AAAAGCATTG 


1800 


GGACCAGGAG 


CGACACTAGA 


AGAAATGATG 


ACAGCATGTC 


AGGGAGTGGG 


GGGACCCGGC 


1860 


CATAAAGCAA 


GAGTTTTGGC 


TGAAGCAATG 


AGCCAAGTAA 


CAAATCCAGC 


TACCATAATG 


1920 


ATACAGAAAG 


GCAATTTTAG 


GAACCAAAGA 


AAGACTGTTA 


AGTGTTTCAA 


TTGTGGCAAA 


1980 


GAAGGGCACA 


TAGCCAAAAA 


TTGCAGGGCC 


CCTAGGAAAA 


AGGGCTGTTG 


GAAATGTGGA 


2040 


AAGGAAGGAC 


ACCAAATGAA 


AGATTGTACT 


GAGAGACAGG 


CTAATTTTTT 


AGGGAAGATC 


2100 


TGGCCTTCCC 


ACAAGGGAAG 


GCCAGGGAAT 


TTTCTTCAGA 


GCAGACCAGA 


GCCAACAGCC 


2160 


CCACCAGAAG 


AGAGCTTCAG 


GTTTGGGGAA 


GAGACAACAA 


CTCCCTCTCA 


GAAGCAGGAG 


2220 


CCGATAGACA 


AGGAACTGTA 


TCCTTTAGCT 


TCCCTCAGAT 


CACTCTTTGG 


CAGCGACCCC 


2280 


TCGTCACAAT 


AAAGATAGGG 


GGGCAATTAA 


AGGAAGCTCT 


ATTAGATACA 


GGAGCAGATG 


2340 


ATACAGTATT 


AGAAGAAATG 


AATTTGCCAG 


GAAGATGGAA 


ACCAAAAATG 


ATAGGGGGAA 


2400 


TTGGAGGTTT 


TATCAAAGTA 


GGACAGTATG 


ATCAGATACT 


CATAGAAATC 


TGCGGACATA 


2460 


AAGCTATAGG 


TACAGTATTA 


GTAGGACCTA 


CACCTGTCAA 


CATAATTGGA 


AGAAATCTGT 


2520 


TGACTCAGAT 


TGGCTGCACT 


TTAAATTTTC 


CCATTAGTCC 


TATTGAGACT 


GTACCAGTAA 


2580 


AATTAAAGCC 


AGGAATGGAT 


GGCCCAAAAG 


TTAAACAATG 


GCCATTGACA 


GAAGAAAAAA 


2640 


TAAAAGCATT 


AGTAGAAATT 


TGTACAGAAA 


TGGAAAAGGA 


AGGAAAAATT 


TCAAAAATTG 


2700 


GGCCTGAAAA 


TCCATACAAT 


ACTCCAGTAT 


TTGCCATAAA 


GAAAAAAGAC 


AGTACTAAAT 


2760 


GGAGAAAATT 


AGTAGATTTC 


AGAGAACTTA 


ATAAGAGAAC 


TCAAGATTTC 


TGGGAAGTTC 


2820 


AATTAGGAAT 


ACCACATCCT 


GCAGGGTTAA 


AACAGAAAAA 


ATCAGTAACA 


GTACTGGATG 


2880 


TGGGCGATGC 


ATATTTTTCA 


GTTCCCTTAG 


ATAAAGACTT 


CAGGAAGTAT 


ACTGCATTTA 


2940 


CCATACCTAG 


TATAAACAAT 


GAGACACCAG 


GGATTAGATA 


TCAGTACAAT 


GTGCTTCCAC 


3000 


AGGGATGGAA 


AGGATCACCA 


GCAATATTCC 


AGTGTAGCAT 


GACAAAAATC 


TTAGAGCCTT 


3060 


TTAGAAAACA 


AAATCCAGAC 


ATAGTCATCT 


ATCAATACAT 


GGATGATTTG 


TATGTAGGAT 


3120 


CTGACTTAGA 


AATAGGGCAG 


CATAGAACAA 


AAATAGAGGA 


ACTGAGACAA 


CATCTGTTGA 


3180 


GGTGGGGATT 


TACCACACCA 


GACAAAAAAC 


ATCAGAAAGA 


ACCTCCATTC 


CTTTGGATGG 


3240 


GTTATGAACT 


CCATCCTGAT 


AAATGGACAG 


TACAGCCTAT 


AGTGCTGCCA 


GAAAAGGACA 


3300 


GCTGGACTGT 


CAATGACATA 


CAGAAATTAG 


TGGGAAAATT 


GAATTGGGCA 


AGTCAGATTT 


3360 


ATGCAGGGAT 


TAAAGTAAGG 


CAATTATGTA 


AACTTCTTAG 


GGGAACCAAA 


GCACTAACAG 


3420 


AAGTAGTACC 


ACTAACAGAA 


GAAGCAGAGC 


TAGAACTGGC 


AGAAAACAGG 


GAGATTCTAA 


3480 


AAGAACCGGT 


ACATGGAGTG 


TATTATGACC 


CATCAAAAGA 


CTTAATAGCA 


GAAATACAGA 


3540 



-96- 



AGCAGGGGCA 


AGGCCAATGG 


ACATATCAAA 


TTTATCAAGA 


GCCATTTAAA 


AATGTGAAAA 


3600 


CAGGAAAATA 


TGCAAGAATG 


AAGGGTGCCC 


ACACTAATGA 


TGTGAAACAA 


TTAACAGAGG 


3660 


CAGTACAAAA 


AATAGCCACA 


GAAAGCATAG 


TAATATGGGG 


AAAGACTCCT 


AAATTTAAAT 


3720 


TACCCATACA 


AAAGGAAACA 


TGGGAAGCAT 


GGTGGACAGA 


GTATTGGCAA 


GCCACCTGGA 


3780 


TTCCTGAGTG 


GGAGTTTGTC 


AATACCCCTC 


CCTTAGTGAA 


GTTATGGTAC 


CAGTTAGAGA 


3840 


AAGAACCCAT 


AATAGGAGCA 


GAAACTTTCT 


ATGTAGATGG 


GGCAGCCAAT 


AGGGAAACTA 


3900 


AATTAGGAAA 


AGCAGGATAT 


GTAACTGACA 


GAGGAAGACA 


AAAAGTTGTC 


CCCCTAACGG 


3960 


ACACAACAAA 


TCAGAAGACT 


GAGTTACAAG 


CAATTCATCT 


AGCTTTGCAG 


GATTCGGGAT 


4020 


TAGAAGTAAA 


CATAGTGACA 


GACTCACAAT 


ATGCATTGGG 


AATCATTCAA 


GCACAACCAG 


4080 


ATAAGAGTGA 


ATCAGAGTTA 


GTCAGTCAAA 


TAATAGAGCA 


GTTAATAAAA 


AAGGAAAAAG 


4140 


TCTACCTGGC 


ATGGGTACCA 


GCACACAAAG 


GAATTGGAGG 


AAATGAACAA 


GTAGATGGGT 


4200 


TGGTCAGTGC 


TGGAATCAGG 


AAAGTACTAT 


TTTTAGATGG 


AATAGATAAG 


GCCCAAGAAG 


4260 


AACATGAGAA 


ATATCACAGT 


AATTGGAGAG 


CAATGGCTAG 


TGATTTTAAC 


CTACCACCTG 


4320 


TAGTAGCAAA 


AGAAATAGTA 


GCCAGCTGTG 


ATAAATGTCA 


GCTAAAAGGG 


GAAGCCATGC 


4380 


ATGGACAAGT 


AGACTGTAGC 


CCAGGAATAT 


GGCAGCTAGA 


TTGTACACAT 


TTAGAAGGAA 


4440 


AAGTTATCTT 


GGTAGCAGTT 


CATGTAGCCA 


GTGGATATAT 


AGAAGCAGAA 


GTAATTCCAG 


4500 


CAGAGACAGG 


GCAAGAAACA 


GCATACTTCC 


TCTTAAAATT 


AGCAGGAAGA 


TGGCCAGTAA 


4560 


AAACAGTACA 


TACAGACAAT 


GGCAGCAATT 


TCACCAGTAC 


TACAGTTAAG 


GCCGCCTGTT 


4620 


GGTGGGCGGG 


GATCAAGCAG 


GAATTTGGCA 


TTCCCTACAA 


TCCCCAAAGT 


CAAGGAGTAA 


4680 


TAGAATCTAT 


GAATAAAGAA 


TTAAAGAAAA 


TTATAGGACA 


GGTAAGAGAT 


CAGGCTGAAC 


4740 


ATCTTAAGAC 


AGCAGTACAA 


ATGGCAGTAT 


TCATCCACAA 


TTTTAAAAGA 


AAAGGGGGGA 


4800 


TTGGGGGGTA 


CAGTGCAGGG 


GAAAGAATAG 


TAGACATAAT 


AGCAACAGAC 


ATACAAACTA 


4860 


AAGAATTACA 


AAAACAAATT 


ACAAAAATTC 


AAAATTTTCG 


GGTTTATTAC 


AGGGACAGCA 


4920 


GAGATCCAGT 


TTGGAAAGGA 


CCAGCAAAGC 


TCCTCTGGAA 


AGGTGAAGGG 


GCAGTAGTAA 


4980 


TACAAGATAA 


TAGTGACATA 


AAAGTAGTGC 


CAAGAAGAAA 


AGCAAAGATC 


ATCAGGGATT 


5040 


ATGGAAAACA 


GATGGCAGGT 


GATGATTGTG 


TGGCAAGTAG 


ACAGGATGAG 


GATTAACACA 


5100 


TGGAAAAGAT 


TAGTAAAACA 


CCATATGTAT 


ATTTCAAGGA 


AAGCTAAGGA 


CTGGTTTTAT 


5160 


AGACATCACT 


ATGAAAGTAC 


TAATCCAAAA 


AT7AGTTCAG 


AAGTACACAT 


CCCACTAGGG 


5220 


GATGCTAAAT 


TAGTAATAAC 


AACATATTGG 


GGTCTGCATA 


CAGGAGAAAG 


AGACTGGCAT 


5280 


TTGGGTCAGG 


GAGTCTCCAT 


AGAATGGAGG 


AAAAAGAGAT 


ATAGCACACA 


AGTAGACCCT 


5340 
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GACCTAGCAG 


ACCAACTAAT 


TCATCTGCAC 


TATTTTGATT 


GTTTTTCAGA 


ATCTGCTATA 


5400 


AGAAATACCA 


TATTAGGACG 


TATAGTTAGT 


CCTAGGTGTG 


AATATCAAGC 


AGGACATAAC 


5460 


AAGGTAGGAT 


CTCTACAGTA 


CTTGGCACTA 


GCAGCATTAA 


TAAAACCAAA 


ACAGATAAAG 


5520 


rCACCTTTGC 


CTAGTGTTAG 


GAAACTGACA 


GAGGACAGAT 


GGAACAAGCC 


CCAGAAGACC 


5580 


AAGGGCCACA 


GAGGGAGCCA 


TACAATGAAT 


GGACACTAGA 


GCTTTTAGAG 


GAACTTAAGA 


5640 


GTGAAGCTGT 


TAGACATTTT 


CCTAGGATAT 


GGCTCCATAA 


CTTAGGACAA 


CATATCTATG 


5700 


AAACTTACGG 


GGATACTTGG 


GCAGGAGTGG 


AAGCCATAAT 


AAGAATTCTG 


CAACAACTGC 


5760 


TfiTTTATCCA 


TTTCAGAATT 


GGGTGTCGAC 


ATAGCAGAAT 


AGGCGTTACT 


CGACAGAGGA 


5820 


fiAfirAAGAAA 


TGGAGCCAGT 


AGATCCTAGA 


CTAGAGCCCT 


GGAAGCATCC 


AGGAAGTCAG 


5880 


rPTAAAACTG 


CTTGTACCAA 


TTGCTATTGT 


AAAAAGTGTT 


GCTTTCATTG 


CCAAGTTTGT 


5940 


TTT* A Tfl A P A A 


AAGPCTTAGG 


CATCTCCTAT 


GGCAGGAAGA 


AGCGGAGACA 


GCGACPAAGA 


6000 




APAGTPAGAC 


TCATCAAGCT 


TCTCTATCAA 


AGCAGTAAGT 


AGTACATGTA 


6060 


aTnPA APPTA 


TAATAGTAGC 

X AA X AO X nOx^ 


AATAGTAGCA 


TTAGTAGTAG 


CAATAATAAT 


AGCAATAGTT 


6120 


riTfJTOnTP PA 


TAGTAATCAT 

X X X X 


AGAATATAGG 


AAAATATTAA 


GACAAAGAAA 


AATAGACAGG 


6180 


TTAATHPRATA 


GACTAATAGA 


AAGAGCAGAA 


GACAGTGGCA 


ATGAGAGTGA 


AGGAGAAGTA 


6240 


fpPAfiPAPTTG 


TGGAGATGGG 


GGTGGAAATG 


GGGCACCATG 


CTCCTTGGGA 


TATTGATGAT 


6300 


PTfiTAGTGCT 


ACAGAAAAAT 


TGTGGGTCAC 


AGTCTATTAT 


GGGGTACCTG 


TGTGGAAGGA 


6360 


AfiPAAPPACC 


ACTCTATTTT 


GTGCATCAGA 


TGCTAAAGCA 


TATGATACAG 


AGGTACATAA 


6420 


T/^TTT nnfiP P 

1 O 1 X 1 OVJVJWV^ 


APACATGCCT 


GTGTACCCAC 


AGACCCCAAC 


CCACAAGAAG 


TAGTATTGGT 


6480 


2k a ATrtTnAP A 

AAA X v7 X onv<' A 


GAAAATTTTA 


ACATGTGGAA 


AAATGACATG 


GTAGAACAGA 


TGCATGAGGA 


6540 


X A X AA X V« AO X 


TTATGGGATC 


AAAGCCTAAA 


GCCATGTGTA 


AAATTAACCC 


CACTCTGTGT 


6600 


TanTTTA A Afi 
X Au XXX AAAVJ 


TGPACTGATT 


TGAAGAATGA 


TACTAATACC 


AATAGTAGTA 


GCGGGAGAAT 


6660 


i rvft i uoAvj 


AAAGGAGAGA 

AAAO OxiVJiiVJii 


TAAAAAACTG 

X ************ X 


CTCTTTCAAT 


ATCAGCACAA 


GCATAAGAGA 


6720 


TAAGGTGCAG 


AAAGAATATG 


CATTCTTTTA 


TAAACTTGAT 


ATAGTACCAA 


TAGaTAATAC 




CAGCTATAGG 


TTGATAAGTT 


GTAACACCTC 


AGTCATTACA 


CAGGCCTGTC 


CAAAGGTATC 


6840 


CTTTGAGCCA 


ATTCCCATAC 


ATTATTGTGC 


CCCGGCTGGT 


TTTGCGATTC 


TAAAATGTAA 


6900 


TAATAAGACG 


TTCAATGGAA 


CAGGACCATG 


TACAAATGTC 


AGCACAGTAC 


AATGTACACA 


6960 


TGGAATCAGG 


CCAGTAGTAT 


CAACTCAACT 


GCTGTTAAAT 


GGCAGTCTAG 


CAGAAGAAGA 


7020 


TGTAGTAATT 


AGATCTGCCA 


ATTTCACAGA 


CAATGCTAAA 


ACCATAATAG 


TACAGCTGAA 


7080 


CACATCTGTA 


GAAATTAATT 


GTACAAGACC 


CAACAACAAT 


ACAAGAAAAA 


GTATCCGTAT 


7140 
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CCAGAGGGGA 


CCAGGGAGAG 


CATTTGTTAC 


AATAGGAAAA 


ATAGGAAATA 


TGAGACAAGC 


7200 


ACATTGTAAC 


ATTAGTAGAG 


CAAAATGGAA 


TGCCACTTTA 


AAACAGATAG 


CTAGCAAATT 


7260 


AAGAGAACAA 


TTTGGAAATA 


ATAAAACAAT 


AATCTTTAAG 


CAATCCTCAG 


GAGGGGACCC 


7320 


AGAAATTGTA 


ACGCACAGTT 


TTAATTGTGG 


AGGGGAATTT 


TTCTACTGTA 


ATTCAACACA 


7380 


ACTGTTTAAT 


AGTACTTGGT 


TTAATAGTAC 


TTGGAGTACT 


GAAGGGTCAA 


ATAACACTGA 


7440 


AGGAAGTGAC 


ACAATCACAC 


TCCCATGCAG 


AATAAAACAA 


TTTATAAACA 


TGTGGCAGGA 


7500 


AGTAGGAAAA 


GCAATGTATG 


CCCCTCCCAT 


CAGTGGACAA 


ATTAGATGTT 


CATCAAATAT 


7560 


TACTGGGCTG 


CTATTAACAA 


GAGATGGTGG 


TAATAACAAC 


AATGGGTCCG 


AGATCTTCAG 


7620 


ACCTGGAGGA 


GGCGATATGA 


GGGACAATTG 


GAGAAGTGAA 


TTATATAAAT 


ATAAAGTAGT 


7680 


AAAAATTGAA 


CCATTAGGAG 


TAGCACCCAC 


CAAGGCAAAG 


AGAAGAGTGG 


TGCAGAGAGA 


7740 


AAAAAGAGCA 


GTGGGAATAG 


GAGCTTTGTT 


CCTTGGGTTC 


TTGGGAGCAG 


CAGGAAGCAC 


7800 


TATGGGCTGC 


ACGTCAATGA 


CGCTGACGGT 


ACAGGCCAGA 


CAATTATTGT 


CTGATATAGT 


7860 


GCAGCAGCAG 


AACAATTTGC 


TGAGGGCTAT 


TGAGGCGCAA 


CAGCATCTGT 


TGCAACTCAC 


7920 


AGTCTGGGGC 


ATCAAACAGC 


TCCAGGCAAG 


AATCCTGGCT 


GTGGAAAGAT 


ACCTAAAGGA 


7980 


TCAACAGCTC CTGGGGATTT GGGGTTGCTC 


TGGAAAACTC 


ATTTGCACCA 


CTGCTGTGCC 


8040 


TTGGAATGCT 


AGTTGGAGTA 


ATAAATCTCT 


GGAACAGATT 


TGGAATAACA 


TGACCTGGAT 


8100 


GGAGTGGGAC 


AGAGAAATTA 


ACAATTACAC 


AAGCTTAATA 


CACTCCTTAA 


TTGAAGAATC 


8160 


GCAAAACCAG 


CAAGAAAAGA 


ATGAACAAGA 


ATTATTGGAA 


TTAGATAAAT 


GGGCAAGTTT 


8220 


GTGGAATTGG 


TTTAACATAA 


CAAATTGGCT 


GTGGTATATA 


AAATTATTCA 


TAATGATAGT 


8280 


AGGAGGCTTG 


GTAGGTTTAA 


GAATAGTTTT 


TGCTGTACTT 


TCTATAGTGA 


ATAGAGTTAG 


8340 


GCAGGGATAT 


TCACCATTAT 


CGTTTCAGAC 


CCACCTCCCA 


ATCCCGAGGG 


GACCCGACAG 


8400 


GCCCGAAGGA 


ATAGAAGAAG 


AAGGTGGAGA 


GAGAGACAGA 


GACAGATCCA 


TTCGATTAGT 


8460 


GAACGGATCC 


TTAGCACTTA 


TCTGGGACGA 


TCTGCGGAGC 


CTGTGCCTCT 


TCAGCTACCA 


8520 


CCGCTTGAGA 


GACTTACTCT 


TGATTGTAAC 


GAGGATTGTG 


GAACTTCTGG 


GACGCAGGGG 


8580 


GTGGGAAGCC 


CTCAAATATT 


GGTGGAATCT 


CCTACAGTAT 


TGGAGTCAGG 


AACTAAAGAA 


8640 


TAGTGCTGTT 


AACTTGCTCA 


ATGCCACAGC CATAGCAGTA GCTGAGGGGA CAGATAGGGT 


8700 


TATAGAAGTA 


TTACAAGCAG 


CTTATAGAGC 


TATTCGCCAC 


ATACCTAGAA 


GAATAAGACA 


8760 


GGGCTTGGAA 


AGGATTTTGC 


TATAAGATGG 


GTGGCAAGTG 


GTCAAAAAGT 


AGTGTGATTG 


8820 


GATGGCCTGC 


TGTAAGGGAA 


AGAATGAGAC 


GAGCTGAGCC 


AGCAGCAGAT 


GGGGTGGGAG 


8880 


CAGTATCTCG 


AGACCTAGAA 


AAACATGGAG 


CAATCACAAG 


TAGCAATACA 


GCAGCTAACA 


8940 
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ATGCTGCTTG 


TGCCTGGCTA 


GAAGCACAAG 


AGGAGGAAGA 


GGTGGGTTTT 


CCAGTCACAC 


9000 


CTCAGGTACC 


TTTAAGACCA 


ATGACTTACA 


AGGCAGCTGT 


AGATCTTAGC 


CACTTTTTAA 


9060 


AAGAAAAGGG 


GGGACTGGAA 


GGGCTAATTC 


ACTCCCAAAG 


AAGACAAGAT 


ATCCTTGATC 


9120 


TGTGGATCTA 


CCACACACAA 


GGCTACTTCC 


CTGATTGGCA 


GAACTACACA 


CCAGGGCCAG 


9180 


GGGTCAGATA 


TCCACTGACC 


TTTGGATGGT 


GCTACAAGCT 


AGTACCAGTT 


GAGCCAGATA 


9240 


AGGTAGAAGA 


GGCCAATAAA 


GGAGAGAACA 


CCAGCTTGTT 


ACACCCTGTG 


AGCCTGCATG 


9300 


GAATGGATGA 


CCCTGAGAGA 


GAAGTGTTAG 


AGTGGAGGTT 


TGACAGCCGC 


CTAGCATTTC 


9360 


ATCACGTGGC 


CCGAGAGCTG 


CATCCGGAGT 


ACTTCAAGAA 


CTGCTGACAT 


CGAGCTTGCT 


9420 


ACAAGGGACT 


TTCCGCTGGG 


GACTTTCCAG 


GGAGGCGTGG 


CCTGGGCGGG 


ACTGGGGAGT 


9480 


GGCGAGCCCT 


CAGATGCTGC 


ATATAAGCAG 


CTGCTTTTTG 


CCTGTACTGG 


GTCTCTCTGG 


9540 


TTAGACCAGA 


TCTGAGCCTG 


GGAGCTCTCT 


GGCTAACTAG 


GGAACCCACT 


GCTTAAGCCT 


9600 


CAATAAAGCT 


TGCCTTGAGT 


GCTTCAAGTA 


GTGTGTGCCC 


GTCTGTTGTG 


TGACTCTGGT 


9660 


AACTAGAGAT 


CCCTCAGACC 


CTTTTAGTCA 


GTGTGGAAAA 


TCTCTAGCA 




9709 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
ATGGGTGGCA 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
TGGGTGGCAA 



(2) INFORMATION FOR SEQ ID N0:4: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4 

GGGTGGCAAG 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
GGTGGCAAGT 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 
GTGGCAAGTG 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
TGGCAAGTGG 



(2) INFORMATION FOR SEQ ID NO: 8: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GGCAAGTGGT 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GCAAGTGGTC 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
•(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
CAAGTGGTCA 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
AAGTGGTCAA 



(2) INFORMATION FOR SEQ ID NO: 12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
AGTGGTCAAA 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
GTGGTCAAAA 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
TGGTCAAAAA 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
GGTCAAAAAG 



(2) INFORMATION FOR SEQ ID NO: 16: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
GTCAAAAAGT 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 17 
TCAAAAAGTA 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
CAAAAAGTAG 



(2) INFORMATION FOR SEQ ID N0:19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
AAAAAGTAGT ' 



(2) INFORMATION FOR SEQ ID NO: 20: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
AAAAGTAGTG 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
AAAGTAGTGT ' 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
AAGTAGTGTG 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
AGTAGTGTGA 



(2) INFORMATION FOR SEQ ID NO: 24: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
GTAGTGTGAT 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
TAGTGTGATT 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
AGTGTGATTG 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
GTGTGATTGG 



(2) INFORMATION FOR SEQ ID NO: 28: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
TGTGATTGGA 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
GTGATTGGAT 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

* (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
TGATTGGATG 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
GATTGGATGG 



(2) INFORMATION FOR SEQ ID NO: 32: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 
ATTGGATGGC 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 
TTGGATGGCC 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

' (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 
TGGATGGCCT 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 
GGATGGCCTG 



(2) INFORMATION FOR SEQ ID NO: 36: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 
GATGGCCTGC 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 
ATGGCCTGCT 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 
TGGCCTGCTG 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 
GGCCTGCTGT ' 



(2) INFORMATION FOR SEQ ID NO: 40: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 
GCCTGCTGTA 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 
CCTGCTGTAA • 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 
CTGCTGTAAG 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43 
TGCTGTAAGG 



(2) INFORMATION FOR SEQ ID NO: 44: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 
GCTGTAAGGG 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45 
CTGTAAGGGA 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46 
TGTAAGGGAA 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
* (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47 

GTAAGGGAAA 



(2) INFORMATION FOR SEQ ID NO: 48: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48 
TAAGGGAAAG 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
•(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49 
AAGGGAAAGA 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 
AGGGAAAGAA 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

• (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 
GGGAAAGAAT 



(2) INFORMATION FOR SEQ ID NO: 52: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 
GGAAAGAATG 



(2) INFORMATION FOR SEQ ID NO:53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
' (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 
GAAAGAATGA 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 
AAAGAATGAG 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 
AAGAATGAGA 



(2) INFORMATION FOR SEQ ID NO: 56: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
AGAATGAGAC 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
GAATGAGACG 



(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
AATGAGACGA 



(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
ATGAGACGAG 



(2) INFORMATION FOR SEQ ID NO: 60: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

{xi> SEQUENCE DESCRIPTION: SEQ ID NO: 60 
TGAGACGAGC 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 
GAGACGAGCT . 



(2) INFORMATION FOR SEQ ID NO:62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62 
AGACGAGCTG 



(2) INFORMATION FOR SEQ ID NO:63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 
GACGAGCTGA 



(2) INFORMATION FOR SEQ ID NO: 64: 



-115- 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 
ACGAGCTGAG 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 
CGAGCTGAGC 



(2) INFORMATION FOR SEQ ID NO:66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 
GAGCTGAGCC 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
•(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 

AGCTGAGCCA 



(2) INFORMATION FOR SEQ ID NO: 68: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68 
GCTGAGCCAG 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

, (C) STRANDEDNESS: single 
' (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 
CTGAGCCAGC 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
• (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 
TGAGCCAGCA 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71 
GAGCCAGCAG 



(2) INFORMATION FOR SEQ ID NO: 72: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 
AGCCAGCAGC 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 
GCCAGCAGCA 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SfeQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 
CCAGCAGCAG 



(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 
CAGCAGCAGA 



(2) INFORMATION FOR SEQ ID NO: 76: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76 
AGCAGCAGAT 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 77 
GCAGCAGATG 



(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 
CAGCAGATGG 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 
AGCAGATGGG 



(2) INFORMATION FOR SEQ ID NO: 80: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80 
GCAGATGGGG 



(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 10 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81 
CAGATGGGGT 



(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 
AGATGGGGTG 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 
GATGGGGTGG 



(2) INFORMATION FOR SEQ ID NO: 84: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
ATGGGGTGGG 



(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
TGGGGTGGGA 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 
GGGGTGGGAG 



(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
• (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 
GGGTGGGAGC 



(2) INFORMATION FOR SEQ ID NO: 88: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 
GGTGGGAGCA 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
, (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89 
GTGGGAGCAG 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

• (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 
TGGGAGCAGT 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:91 
GGGAGCAGTA 



(2) INFORMATION FOR SEQ ID NO: 92: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 
GGAGCAGTAT 



(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CJiARACTERISTICS : 

* (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: 
GAGCAGTATC 



(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
AGCAGTATCT 



(2) INFORMATION FOR SEQ ID NO:95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:95: 
GCAGTATCTC 



(2) INFORMATION FOR SEQ ID NO: 96: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96 
CAGTATCTCG 



(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 
AGTATCTCGA 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

{A} LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:98 
GTATCTCGAG • 



(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99 
TATCTCGAGA 



(2) INFORMATION FOR SEQ ID NO: 100: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 
ATCTCGAGAC 



(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 
TCTCGAGACC • 



(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 
CTCGAGACCT 



(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 
TCGAGACCTA 



(2) INFORMATION FOR SEQ ID NO: 104: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 
CGAGACCTAG 



(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105 
GAGACCTAGA 



(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106 
AGACCTAGAA 



(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
' (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107 
GACCTAGAAA 



(2) INFORMATION FOR SEQ ID NO: 108: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108 
ACCTAGAAAA 



(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109 
CCTAGAAAAA 



(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110 
CTAGAAAAAC 



(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111 
TAGAAAAACA 



(2) INFORMATION FOR SEQ ID NO: 112: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 
AGAAAAACAT 



(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
* (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 
GAAAAACATG 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 
AAAAACATGG 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115 

AAAACATGGA 



(2) INFORMATION FOR SEQ ID NO: 116: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116 
AAACATGGAG 



(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117 
AACATGGAGC 



(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118 
ACATGGAGCA • 



(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119 
CATGGAGCAA 



(2) INFORMATION FOR SEQ ID NO: 120: 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120 
ATGGAGCAAT 



(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121 
TGGAGCAATC 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEOUENCE DESCRIPTION: SEQ ID NO: 122 
GGAGCAATCA 



(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 
GAGCAATCAC 



(2) INFORMATION FOR SEQ ID NO: 124: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124 
AGCAATCACA 



(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125 
GCAATCACAA 



(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126 
CAATCACAAG 



(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

, (C) STRANDEDNESS: single 
* (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127 
AATCACAAGT 



(2) INFORMATION FOR SEQ ID NO: 128: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128 
ATCACAAGTA 



(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129 
TCACAAGTAG 



(2) INF0R14ATI0N FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

• (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130 
CACAAGTAGC 



(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131 
ACAAGTAGCA 



(2) INFORMATION FOR SEQ ID NO: 132: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132 
CAAGTAGCAA 



(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 133 
AAGTAGCAAT 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134 
AGTAGCAATA 



(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135 
GTAGCAATAC 



(2) INFORMATION FOR SEQ ID NO: 136: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136 
TAGCAATACA 



(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137 
AGCAATACAG 



(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138 
GCAATACAGC 



(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139 
CAATACAGCA 



(2) INFORMATION FOR SEQ ID NO: 140: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140 
AATACAGCAG 



(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141 
ATACAGCAGC 



(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142 
TACAGCAGCT 



(2) INFORMATION FOR SEQ ID NO: 14 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143 
ACAGCAGCTA 



(2) INFORMATION FOR SEQ ID NO: 144: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144 
CAGCAGCTAA 



(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145 
AGCAGCTAAC 



(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146 
GCAGCTAACA 



(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
• (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147 
CAGCTAACAA 



(2) INFORMATION FOR SEQ ID NO: 148: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148 
AGCTAACAAT 



(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149 
GCTAACAATG 



(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150 
CTAACAATGC 



(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151 
TAACAATGCT 



(2) INFORMATION FOR SEQ ID NO: 152: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152 
AACAATGCTG 



(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153 
ACAATGCTGC 



(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154 
CAATGCTGCT 



(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155 
AATGCTGCTT 



(2) INFORMATION FOR SEQ ID NO: 156: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156 
ATGCTGCTTG 



(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157 
TGCTGCTTGT . 



(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158 
GCTGCTTGTG 



(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159 
CTGCTTGTGC 



(2) INFORMATION FOR SEQ ID NO: 160: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160 
TGCTTGTGCC 



{2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161 
GCTTGTGCCT 



(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162 
CTTGTGCCTG 



(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
' (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163 
TTGTGCCTGG 



(2) INFORMATION FOR SEQ ID NO: 164: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164 
TGTGCCTGGC 



(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

• (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165 
GTGCCTGGCT 



(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166 
TGCCTGGCTA 



(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167 
GCCTGGCTAG 



(2) INFORMATION FOR SEQ ID NO: 168: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168 
CCTGGCTAGA 



(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169 
CTGGCTAGAA 



(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170 
TGGCTAGAAG • 



(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171 
GGCTAGAAGC 



(2) INFORMATION FOR SEQ ID NO: 172: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172 
GCTAGAAGCA 



(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173 
CTAGAAGCAC 



(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174 
TAGAAGCACA 



(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
'(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175 
AGAAGCACAA 



(2) INFORMATION FOR SEQ ID NO: 176: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176 
GAAGCACAAG 



(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177 
AAGCACAAGA 



(2) INFORMATION FOR SEQ ID NO: 178: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178 
AGCACAAGAG 



(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179 
GCACAAGAGG 



(2) INFORMATION FOR SEQ ID NO: 180: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180 
CACAAGAGGA 



(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181 
ACAAGAGGAG 



(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182 
CAAGAGGAGG 



(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183 
AAGAGGAGGA 



(2) INFORMATION FOR SEQ ID NO: 184: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184 
AGAGGAGGAA 



(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185 
GAGGAGGAAG 



(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186 
AGGAGGAAGA 



(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187 
GGAGGAAGAG 



(2) INFORMATION FOR SEQ ID NO: 188: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188 
GAGGAAGAGG 



(2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189 
AGGAAGAGGT 



(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190 
GGAAGAGGTG 



(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191 
GAAGAGGTGG 



(2) INFORMATION FOR SEQ ID NO: 192: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192 
AAGAGGTGGG 



(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
' (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193 
AGAGGTGGGT 



(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

* (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194 
GAGGTGGGTT 



(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

• (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 5 
AGGTGGGTTT 



(2) INFORMATION FOR SEQ ID NO: 196: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196 
GGTGGGTTTT 



(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197 
GTGGGTTTTC 



(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198 
TGGGTTTTCC 



(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199 
GGGTTTTCCA ' 



(2) INFORMATION FOR SEQ ID NO: 200: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200 
GGTTTTCCAG 



(2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CPIARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 
GTTTTCCAGT • 



(2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 
TTTTCCAGTC 



(2) INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:203: 
TTTCCAGTCA 



(2) INFORMATION FOR SEQ ID NO: 204: 
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{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204 
TTCCAGTCAC 



(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 05 
TCCAGTCACA 



(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
' (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206 
CCAGTCACAC 



(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
- (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207 
CAGTCACACC 



(2) INFORMATION FOR SEQ ID NO: 208: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:208 
AGTCACACCT 



(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:209 
GTCACACCTC 



(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

• (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210 
TCACACCTCA 



(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:211 
CACACCTCAG 



(2) INFORMATION FOR SEQ ID NO: 212: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:212 
ACACCTCAGG 



(2) INF0R14ATI0N FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:213 
CACCTCAGGT 



(2) INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 14 
ACCTCAGGTA 



(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215 
CCTCAGGTAC 



(2) INFORMATION FOR SEQ ID NO: 2 16: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216 
CTCAGGTACC 



(2) INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217 
TCAGGTACCT 



(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218 
CAGGTACCTT 



(2) INFORMATION FOR SEQ ID NO: 2 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219 
AGGTACCTTT 



(2) INFORMATION FOR SEQ ID NO: 220: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220 
GGTACCTTTA 



(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

• (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221 
GTACCTTTAA 



(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222 
TACCTTTAAG 



(2) INFORMATION FOR SEQ ID NO:223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 3 
ACCTTTAAGA 



(2) INFORMATION FOR SEQ ID NO: 224: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224 
CCTTTAAGAC 



(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:225 
CTTTAAGACC 



(2) INFORMATION FOR SEQ ID NO: 22 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226 
TTTAAGACCA 



(2) INFORMATION FOR SEQ ID NO:227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227 
TTAAGACCAA • 



(2) INFORMATION FOR SEQ ID NO: 228: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:228 
TAAGACCAAT 



(2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229 
AAGACCAATG 



(2) INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230 
AGACCAATGA 



(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 31 
GACCAATGAC 



(2) INFORMATION FOR SEQ ID NO: 232: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232 
ACCAATGACT 



(2) INFORMATION FOR SEQ ID NO: 23 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
to STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEOUENCE DESCRIPTION: SEQ ID NO:233 
CCAATGACTT 



(2) INFORMATION FOR SEQ ID NO:234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234 
CAATGACTTA 



(2) INFORMATION FOR SEQ ID NO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235 
AATGACTTAC 



(2) INFORMATION FOR SEQ ID NO: 236: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 23 6 
ATGACTTACA 



(2) INFORMATION FOR SEQ ID NO: 23 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237 
TGACTTACAA 



(2) INFORMATION FOR SEQ ID NO: 238: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

• (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:238 
GACTTACAAG 



(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:239 
ACTTACAAGG 



(2) INFORMATION FOR SEQ ID NO: 24 0: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:240 
CTTACAAGGC 



(2) INFORMATION FOR SEQ ID NO:241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241 
TTACAAGGCA 



(2) INFORMATION FOR SEQ ID NO:242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:242 
TACAAGGCAG 



(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 3 
ACAAGGCAGC ' 



(2) INFORMATION FOR SEQ ID NO: 244: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244 
CAAGGCAGCT 



(2) INFORMATION FOR SEQ ID NO:245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:245 
AAGGCAGCTG 



(2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246 
AGGCAGCTGT 



(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:247 
GGCAGCTGTA 



(2) INFORMATION FOR SEQ ID NO: 24 8: 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248 
GCAGCTGTAG 



(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249 
CAGCTGTAGA 



(2) INFORMATION FOR SEQ ID NO:250: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250 
AGCTGTAGAT 



(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:251 
GCTGTAGATC 



(2) INFORMATION FOR SEQ ID NO:252: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252 
CTGTAGATCT 



(2) INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:253 
TGTAGATCTT 



(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254 
GTAGATCTTA • 



(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255 
TAGATCTTAG 



(2) INFORMATION FOR SEQ ID NO:256: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256 
AGATCTTAGC 



(2) INFORMATION FOR SEQ ID NO:257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MO^iECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:257: 
GATCTTAGCC 



(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 
ATCTTAGCCA 



(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 
TCTTAGCCAC 



(2) INFORMATION FOR SEQ ID NO: 260: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:260 
CTTAGCCACT 



(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

, (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261 
TTAGCCACTT 



(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
* (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262 
TAGCCACTTT 



(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263 
AGCCACTTTT 



(2) INFORMATION FOR SEQ ID NO:264: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264 
GCCACTTTTT 



(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265 
CCACTTTTTA 



(2) INFORMATION FOR SEQ ID NO:266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266 
CACTTTTTAA • 



(2) INFORMATION FOR SEQ ID NO:267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:267 
ACTTTTTAAA 



(2) INFORMATION FOR SEQ ID NO: 268: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:268 
CTTTTTAAAA 



(2) INFORMATION FOR SEQ ID NO:269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269 
TTTTTAAAAG 



(2) INFORMATION FOR SEQ ID NO:270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270 
TTTTAAAAGA 



(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271 
TTTAAAAGAA 



(2) INFORMATION FOR SEQ ID NO: 272: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272 
TTAAAAGAAA 



(2) INFORMATION FOR SEQ ID NO:273: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273 
TAAAAGAAAA 



(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274 
AAAAGAAAAG 



(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275 
AAAGAAAAGG 



(2) INFORMATION FOR SEQ ID NO: 276: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276 
AAGAAAAGGG 



(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277 
AGAAAAGGGG 



(2) INFORMATION FOR SEQ ID NO:278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278 
GAAAAGGGGG 



(2) INFORMATION FOR SEQ ID NO: 2 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279 
AAAAGGGGGG 



(2) INFORMATION FOR SEQ ID NO: 280: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280 
AAAGGGGGGA 



(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:281 
AAGGGGGGAC 



(2) INFORMATION FOR SEQ ID NO: 282: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282 
AGGGGGGACT 



(2) INFORMATION FOR SEQ ID NO:283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283 
GGGGGGACTG 



(2) INFORMATION FOR SEQ ID NO:284: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284 
GGGGGACTGG 



(2) INFORMATION FOR SEQ ID NO: 285: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285 
GGGGACTGGA 



(2) INFORMATION FOR SEQ ID NO: 286: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286 
GGGACTGGAA 



(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287 
GGACTGGAAG 



(2) INFORMATION FOR SEQ ID NO:288: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:288 
GACTGGAAGG 



(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289 
ACTGGAAGGG • 



(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290 
CTGGAAGGGC 



(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291 
TGGAAGGGCT 



(2) INFORMATION FOR SEQ ID NO: 292: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292 
GGAAGGGCTA 



(2) INFORMATION FOR SEQ ID NO: 29 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:293: 

GAAGGGCTAA 



(2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

• (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 
AAGGGCTAAT 



(2) INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 
AGGGCTAATT 



(2) INFORMATION FOR SEQ ID NO: 29 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:296 
GGGCTAATTC 



(2) INFORMATION FOR SEQ ID NO: 297: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 
GGCTAATTCA 



(2) INFORMATION FOR SEQ ID NO: 29 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 
GCTAATTCAC 



(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 
CTAATTCACT . 



(2) INFORMATION FOR SEQ ID NO: 3 00: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300 
TAATTCACTC 



(2) INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301 
AATTCACTCC 



(2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302 
ATTCACTCCC 



(2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303 
TTCACTCCCA 



(2) INFORMATION FOR SEQ ID NO: 3 04: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304 
TCACTCCCAA 



(2) INFORMATION FOR SEQ ID NO: 3 05: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305 
CACTCCCAAA 



(2) INFORMATION FOR SEQ ID NO: 306: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306 
ACTCCCAAAG 



(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

• (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307 
CTCCCAAAGA 



(2) INFORMATION FOR SEQ ID NO: 3 08: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308 
TCCCAAAGAA 



(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309 
CCCAAAGAAG 



(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310 
CCAAAGAAGA 



(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311 
CAAAGAAGAC . 



(2) INFORMATION FOR SEQ ID NO: 312: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 
AAAGAAGACA 



(2) INFORMATION FOR SEQ ID NO:313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 
AAGAAGACAA 



(2) INFORMATION FOR SEQ ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:314: 
AGAAGACAAG 



(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 
GAAGACAAGA 



(2) INFORMATION FOR SEQ ID NO: 316: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316 
AAGACAAGAT 



(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
* (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317 
AGACAAGATA 



(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
• (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318 
GACAAGATAT 



(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319 
ACAAGATATC 



(2) INFORMATION FOR SEQ ID NO: 320: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:320 
CAAGATATCC 



(2) INFORMATION FOR SEQ ID NO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321 
AAGATATCCT 



(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322 
AGATATCCTT 



(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323 
GATATCCTTG 



(2) INFORMATION FOR SEQ ID NO: 324: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324 
ATATCCTTGA 



(2) INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325 
TATCCTTGAT 



(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326 
ATCCTTGATC 



(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327 
TCCTTGATCT 



(2) INFORMATION FOR SEQ ID NO: 328: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328 
CCTTGATCTG 



(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
• (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329 
CTTGATCTGT 



(2) INFORMATION FOR SEQ ID NO: 330: 

(i) SEQUENCE CHARACTERISTICS: 

' (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:330 
TTGATCTGTG 



(2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331 
TGATCTGTGG 



(2) INFORMATION FOR SEQ ID NO: 332: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332 
GATCTGTGGA 



(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333 
ATCTGTGGAT 



(2) INFORMATION FOR SEQ ID NO: 334: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEOUENCE DESCRIPTION: SEQ ID NO: 334 
TCTGTGGATC 



(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335 
CTGTGGATCT 



(2) INFORMATION FOR SEQ ID NO: 336: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336 
TGTGGATCTA 



(2) INFORMATION FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
.(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337 
GTGGATCTAC 



(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338 
TGGATCTACC 



(2) INFORMATION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339 
GGATCTACCA 



(2) INFORMATION FOR SEQ ID NO: 340: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340 
GATCTACCAC 



(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341 
ATCTACCACA 



(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342 
TCTACCACAC 



(2) INFORMATION FOR SEQ ID NO: 34 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343 
CTACCACACA 



(2) INFORMATION FOR SEQ ID NO: 344: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344 
TACCACACAC 



(2) INFORMATION FOR SEQ ID NO: 345: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 
ACCACACACA 



(2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 
CCACACACAA 



(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 
CACACACAAG 



(2) INFORMATION FOR SEQ ID NO:348: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348 
ACACACAAGG 



(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
. (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 
CACACAAGGC 



(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
' (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 
ACACAAGGCT 



(2) INFORMATION FOR SEQ ID NO: 351: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 
CACAAGGCTA 



(2) INFORMATION FOR SEQ ID NO: 352: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352 
ACAAGGCTAC 



(2) INFORMATION FOR SEQ ID NO:353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:353 
CAAGGCTACT 



(2) INFORMATION FOR SEQ ID NO: 3 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:354 
AAGGCTACTT . 



(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355 
AGGCTACTTC 



(2) INFORMATION FOR SEQ ID NO: 356: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356 
GGCTACTTCC 



(2) INFORMATION FOR SEQ ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357 
GCTACTTCCC 



(2) INFORMATION FOR SEQ ID NO: 358: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
, (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358 
CTACTTCCCT 



(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

• (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359 
TACTTCCCTG 



(2) INFORMATION FOR SEQ ID NO: 360: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360 
ACTTCCCTGA 



(2) INFORMATION FOR SEQ ID NO: 361: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361 
CTTCCCTGAT 



(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362 
TTCCCTGATT 



(2) INFORMATION FOR SEQ ID NO: 363: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363 
TCCCTGATTG 



(2) INFORMATION FOR SEQ ID NO: 364: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 64 
CCCTGATTGG 



(2) INFORMATION FOR SEQ ID NO: 365: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365 
CCTGATTGGC 



(2) INFORMATION FOR SEQ ID NO: 366: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366 
CTGATTGGCA 



(2) INFORMATION FOR SEQ ID NO: 367: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367 
TGATTGGCAG 



(2) INFORMATION FOR SEQ ID NO: 368: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368 
GATTGGCAGA 



(2) INFORMATION FOR SEQ ID NO: 369: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
•(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369 
ATTGGCAGAA 



(2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370 
TTGGCAGAAC 



(2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

• (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371 
TGGCAGAACT 



(2) INFORMATION FOR SEQ ID NO: 372: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372 
GGCAGAACTA 



(2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373 
GCAGAACTAC 



(2) INFORMATION FOR SEQ ID NO: 374: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374 
CAGAACTACA 



(2) INFORMATION FOR SEQ ID NO: 375: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375 
AGAACTACAC 



(2) INFORMATION FOR SEQ ID NO: 376: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376 
GAACTACACA 



(2) INFORMATION FOR SEQ ID NO: 377: 

{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377 
AACTACACAC 



(2) INFORMATION FOR SEQ ID NO: 378: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378 
ACTACACACC 



(2) INFORMATION FOR SEQ ID NO:379: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379 
CTACACACCA 



(2) INFORMATION FOR SEQ ID NO: 380: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380 
TACACACCAG 



(2) INFORMATION FOR SEQ ID NO: 381: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381 
ACACACCAGG 



(2) INFORMATION FOR SEQ ID NO: 382: 

(i) SEQUENCE CHARACTERISTICS: 

' (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 82 
CACACCAGGG 



(2) INFORMATION FOR SEQ ID NO: 383: 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: iSEQ ID NO: 383 
ACACCAGGGC 



(2) INFORMATION FOR SEQ ID NO: 384: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 84 
CACCAGGGCC 



(2) INFORMATION FOR SEQ ID NO: 3 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385 
ACCAGGGCCA 



(2) INFORMATION FOR SEQ ID NO: 386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386 
CCAGGGCCAG 



(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387 
CAGGGCCAGG 



(2) INFORMATION FOR SEQ ID NO: 388: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388 
AGGGCCAGGG 



(2) INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389 
GGGCCAGGGG 



(2) INFORMATION FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390 
GGCCAGGGGT 



(2) INFORMATION FOR SEQ ID NO: 391: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
* (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:391 
GCCAGGGGTC 



(2) INFORMATION FOR SEQ ID NO: 392: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392 
CCAGGGGTCA 



(2) INFORMATION FOR SEQ ID NO: 393: 

(i) SEQUENCE CHARACTERISTICS: 

* (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393 
CAGGGGTCAG 



(2) INFORMATION FOR SEQ ID NO: 394: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394 
AGGGGTCAGA 



(2) INFORMATION FOR SEQ ID NO: 395: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395 
GGGGTCAGAT 



(2) INFORMATION FOR SEQ ID NO: 396: 
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{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396 
GGGTCAGATA 



(2) INFORMATION FOR SEQ ID NO: 397: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397 
GGTCAGATAT 



(2) INFORMATION FOR SEQ ID NO: 398: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398 
GTCAGATATC 



(2) INFORMATION FOR SEQ ID NO: 399: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399 
TCAGATATCC 



(2) INFORMATION FOR SEQ ID NO: 400: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400 
CAGATATCCA 



(2) INFORMATION FOR SEQ ID NO: 401: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
' (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401 
AGATATCCAC 



(2) INFORMATION FOR SEQ ID NO: 402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402 
GATATCCACT 



(2) INFORMATION FOR SEQ ID NO: 403: 

(i) SEQUENCE CHARACTERISTICS: 

' (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:403 
ATATCCACTG 



(2) INFORMATION FOR SEQ ID NO: 4 04: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404 
TATCCACTGA 



(2) INFORMATION FOR SEQ ID NO: 405: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405 
ATCCACTGAC 



(2) INFORMATION FOR SEQ ID NO: 40 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 06 
TCCACTGACC . 



(2) INFORMATION FOR SEQ ID NO: 407: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407 
CCACTGACCT 



(2) INFORMATION FOR SEQ ID NO: 408: 



-201- 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408 
CACTGACCTT 



(2) INFORMATION FOR SEQ ID NO:409: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:409 
ACTGACCTTT 



(2) INFORMATION FOR SEQ ID NO: 410: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410 
CTGACCTTTG 



(2) INFORMATION FOR SEQ ID NO: 411: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
• (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411 

TGACCTTTGG 



(2) INFORMATION FOR SEQ ID NO: 4 12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 12 
GACCTTTGGA 



(2) INFORMATION FOR SEQ ID NO: 4 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413 
ACCTTTGGAT 



(2) INFORMATION FOR SEQ ID NO: 414: 

(i) SEQUENCE CHARACTERISTICS: 

' (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414 
CCTTTGGATG 



(2) INFORMATION FOR SEQ ID NO:415: 

(i) SEQUENCE CHARACTERISTICS i 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:415 
CTTTGGATGG 



(2) INFORMATION FOR SEQ ID NO: 416: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416 
TTTGGATGGT 



(2) INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:417 
TTGGATGGTG . 



(2) INFORMATION FOR SEQ ID NO: 418: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:418 
TGGATGGTGC 



(2) INFORMATION FOR SEQ ID NO:419: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 19 
GGATGGTGCT 



(2) INFORMATION FOR SEQ ID NO:420: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:420 
GATGGTGCTA 



(2) INFORMATION FOR SEQ ID NO:421: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:421 
ATGGTGCTAC 



(2) INFORMATION FOR SEQ ID NO: 422: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
.(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422 
TGGTGCTACA 



(2) INFORMATION FOR SEQ ID NO: 42 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423 
GGTGCTACAA 



(2) INFORMATION FOR SEQ ID NO: 424: 
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{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424 
GTGCTACAAG 



(2) INFORMATION FOR SEQ ID NO: 425: 

(i) SflQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425 
TGCTACAAGC 



(2) INFORMATION FOR SEQ ID NO: 426: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 i near 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426 
GCTACAAGCT 



(2) INFORMATION FOR SEQ ID NO: 427: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427 
CTACAAGCTA 



(2) INFORMATION FOR SEQ ID NO: 428: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428 
TACAAGCTAG 



(2) INFORMATION FOR SEQ ID NO: 429: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:429 
ACAAGCTAGT 



(2) INFORMATION FOR SEQ ID NO: 430: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430 
CAAGCTAGTA 



(2) INFORMATION FOR SEQ ID NO: 431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431 
AAGCTAGTAC 



(2) INFORMATION FOR SEQ ID NO: 432: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432 
AGCTAGTACC 



(2) INFORMATION FOR SEQ ID NO:433: 

(i) SEQUENCE CHARACTERISTICS: 

' (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433 
GCTAGTACCA 



(2) INFORMATION FOR SEQ ID NO: 434: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434 
CTAGTACCAG 



(2) INFORMATION FOR SEQ ID NO: 435: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435 
TAGTACCAGT 



(2) INFORMATION FOR SEQ ID NO: 436: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436 
AGTACCAGTT 



(2) INFORMATION FOR SEQ ID NO: 437: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437 
GTACCAGTTG 



(2) INFORMATION FOR SEQ ID NO: 438: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438 
TACCAGTTGA 



(2) INFORMATION FOR SEQ ID NO: 439: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MO^iECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439 
ACCAGTTGAG 



(2) INFORMATION FOR SEQ ID NO: 440: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440 
CCAGTTGAGC 



(2) INFORMATION FOR SEQ ID NO:441: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441 
CAGTTGAGCC 



(2) INFORMATION FOR SEQ ID NO:442: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442 
AGTTGAGCCA 



(2) INFORMATION FOR SEQ ID NO: 443: 

(i) SEQUENCE CHARACTERISTICS: 

• (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443 
GTTGAGCCAG 



(2) INFORMATION FOR SEQ ID NO: 444: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444 
TTGAGCCAGA 



(2) INFORMATION FOR SEQ ID NO:445: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:445 
TGAGCCAGAT 



(2) INFORMATION FOR SEQ ID NO: 446: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446 
GAGCCAGATA 



(2) INFORMATION FOR SEQ ID NO: 447: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:447 
AGCCAGATAA 



(2) INFORMATION FOR SEQ ID NO: 448: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448 
GCCAGATAAG 



(2) INFORMATION FOR SEQ ID NO: 449: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449 
CCAGATAAGG 



(2) INFORMATION FOR SEQ ID NO: 450: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450 
CAGATAAGGT 



(2) INFORMATION FOR SEQ ID NO: 451: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451 
AGATAAGGTA 



(2) INFORMATION FOR SEQ ID NO: 4 52: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452 
GATAAGGTAG 



(2) INFORMATION FOR SEQ ID NO: 453: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
' (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453 
ATAAGGTAGA 



(2) INFORMATION FOR SEQ ID NO:454: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:454 
TAAGGTAGAA 



(2) INFORMATION FOR SEQ ID NO: 455: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455 
AAGGTAGAAG 



(2) INFORMATION FOR SEQ ID NO:456: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456 
AGGTAGAAGA 



(2) INFORMATION FOR SEQ ID NO: 457: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457 
GGTAGAAGAG 



(2) INFORMATION FOR SEQ ID NO: 4 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458 
GTAGAAGAGG 



(2) INFORMATION FOR SEQ ID NO: 459: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459 
TAGAAGAGGC 



(2) INFORMATION FOR SEQ ID NO: 460: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLCMSY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:460 
AGAAGAGGCC 



(2) INFORMATION FOR SEQ ID NO: 461: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 461 
GAAGAGGCCA 



(2) INFORMATION FOR SEQ ID NO: 462: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
• (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462 
AAGAGGCCAA 



(2) INFORMATION FOR SEQ ID NO:463: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:463 
AGAGGCCAAT 



(2) INFORMATION FOR SEQ ID NO: 464: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464 
GAGGCCAATA 



(2) INFORMATION FOR SEQ ID NO: 465: 

(i) SEQUENCE CHARACTERISTICS: 

* (A) LENGTH: 10 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465 
AGGCCAATAA 



(2) INFORMATION FOR SEQ ID NO: 466: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:466 
GGCCAATAAA 



(2) INFORMATION FOR SEQ ID NO: 467: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467 
GCCAATAAAG 



(2) INFORMATION FOR SEQ ID NO: 468: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468 
CCAATAAAGG 



(2) INFORMATION FOR SEQ ID NO: 4 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:469 
CAATAAAGGA 



(2) INFORMATION FOR SEQ ID NO: 470: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:470 
AATAAAGGAG 



(2) INFORMATION FOR SEQ ID NO:471: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:471 
ATAAAGGAGA 



(2) INFORMATION FOR SEQ ID NO: 472: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472 
TAAAGGAGAG 



(2) INFORMATION FOR SEQ ID NO: 473: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
' (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473 
AAAGGAGAGA 



(2) INFORMATION FOR SEQ ID NO:474: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
• (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474 
AAGGAGAGAA 



(2) INFORMATION FOR SEQ ID NO: 475: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475 
AGGAGAGAAC 



(2) INFORMATION FOR SEQ ID NO: 476: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476 
GGAGAGAACA 



(2) INFORMATION FOR SEQ ID NO: 477: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477 
GAGAGAACAC 



(2) INFORMATION FOR SEQ ID NO:478: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478 
AGAGAACACC • 



(2) INFORMATION FOR SEQ ID NO:479: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:479 
GAGAACACCA 



(2) INFORMATION FOR SEQ ID NO: 480: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480 
AGAACACCAG 



(2) INFORMATION FOR SEQ ID NO: 481: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
• (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:481 
GAACACCAGC 



(2) INFORMATION FOR SEQ ID NO: 482: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 82 
AACACCAGCT 



(2) INFORMATION FOR SEQ ID NO: 48 3: 

(i) SEQUENCE CHARACTERISTICS: 

• (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483 
ACACCAGCTT 



(2) INFORMATION FOR SEQ ID NO: 484: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 484 
CACCAGCTTG 



(2) INFORMATION FOR SEQ ID NO: 485: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485 
ACCAGCTTGT 



(2) INFORMATION FOR SEQ ID NO: 486: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:486: 
CCAGCTTGTT 



(2) INFORMATION FOR SEQ ID NO: 487: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:487: 
CAGCTTGTTA 



(2) INFORMATION FOR SEQ ID NO: 488: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:488 
AGCTTGTTAC 



(2) INFORMATION FOR SEQ ID NO: 489: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:489 
GCTTGTTACA 



(2) INFORMATION FOR SEQ ID NO: 490: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
• (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 490 
CTTGTTACAC 



(2) INFORMATION FOR SEQ ID NO: 491: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:491 
TTGTTACACC 



(2) INFORMATION FOR SEQ ID NO:492: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492 
TGTTACACCC 



(2) INFORMATION FOR SEQ ID NO: 49 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:493 
GTTACACCCT 



(2) INFORMATION FOR SEQ ID NO: 4 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494 
TTACACCCTG 



(2) INFORMATION FOR SEQ ID NO:495: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 5 
TACACCCTGT * 



(2) INFORMATION FOR SEQ ID NO: 49 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496 
ACACCCTGTG 



(2) INF0R14ATI0N FOR SEQ ID NO: 497: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 497 
CACCCTGTGA 



(2) INFORMATION FOR SEQ ID NO: 498: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:498 
ACCCTGTGAG 



(2) INFORMATION FOR SEQ ID NO: 499: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
• (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499 
CCCTGTGAGC 



(2) INFORMATION FOR SEQ ID NO: 500: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500 
CCTGTGAGCC 



(2) INFORMATION FOR SEQ ID NO: 501: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501 
CTGTGAGCCT 



(2) INFORMATION FOR SEQ ID NO: 502: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502 
TGTGAGCCTG ^ 



(2) INFORMATION FOR SEQ ID NO: 503: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503 
GTGAGCCTGC 



(2) INFORMATION FOR SEQ ID NO: 504: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504 
TGAGCCTGCA 



(2) INFORMATION FOR SEQ ID NO:505: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 505 
GAGCCTGCAT 



(2) INFORMATION FOR SEQ ID NO: 506: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506 
AGCCTGCATG 



(2) INFORMATION FOR SEQ ID NO: 507: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507 
GCCTGCATGG 



(2) INFORMATION FOR SEQ ID NO: 508: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508 
CCTGCATGGA 



(2) INFORMATION FOR SEQ ID NO: 509: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509 
CTGCATGGAA 



(2) INFORMATION FOR SEQ ID NO: 510: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510 
TGCATGGAAT , 



(2) INFORMATION FOR SEQ ID NO: 511: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 511 
GCATGGAATG 



(2) INFORMATION FOR SEQ ID NO: 512: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512 
CATGGAATGG 



(2) INFORMATION FOR SEQ ID NO: 513: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513 
ATGGAATGGA 



(2) INFORMATION FOR SEQ ID NO: 514: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

* (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514 
TGGAATGGAT 



(2) INFORMATION FOR SEQ ID NO: 515: 

(i) SEQUENCE CHARACTERISTICS: 

• (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515 
GGAATGGATG 



(2) INFORMATION FOR SEQ ID NO: 516: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516 
GAATGGATGA 



(2) INFORMATION FOR SEQ ID NO: 517: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517: 
AATGGATGAC • 



(2) INFORMATION FOR SEQ ID NO: 518: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518: 
ATGGATGACC 



(2) INFORMATION FOR SEQ ID NO: 519: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519: 
TGGATGACCC 



(2) INFORMATION FOR SEQ ID NO: 520: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:520 
GGATGACCCT 



(2) INFORMATION FOR SEQ ID NO: 521: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521 
GATGACCCTG 



(2) INFORMATION FOR SEQ ID NO: 522: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
* (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522 
ATGACCCTGA 



(2) INFORMATION FOR SEQ ID NO: 523: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523 
TGACCCTGAG 



(2) INFORMATION FOR SEQ ID NO: 524: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524 
GACCCTGAGA 



(2) INFORMATION FOR SEQ ID NO: 52 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525 
ACCCTGAGAG 



(2) INFORMATION FOR SEQ ID NO: 526: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526 
CCCTGAGAGA 



(2) INFORMATION FOR SEQ ID NO: 527: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527 
CCTGAGAGAG 



(2) INFORMATION FOR SEQ ID NO: 528: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528 
CTGAGAGAGA 



(2) INFORMATION FOR SEQ ID NO: 529: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529 
TGAGAGAGAA 



(2) INFORMATION FOR SEQ ID NO: 530: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530 
GAGAGAGAAG 



(2) INFORMATION FOR SEQ ID NO: 531: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531 
AGAGAGAAGT 



(2) INFORMATION FOR SEQ ID NO: 532: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532 
GAGAGAAGTG 



(2) INFORMATION FOR SEQ ID NO: 5 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 3 
AGAGAAGTGT 



(2) INFORMATION FOR SEQ ID NO: 534: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534 
GAGAAGTGTT 



(2) INFORMATION FOR SEQ ID NO: 535: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
' (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535 
AGAAGTGTTA 



(2) INFORMATION FOR SEQ ID NO: 536: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 536 
GAAGTGTTAG 



(2) INFORMATION FOR SEQ ID NO: 537: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537 
AAGTGTTAGA 



(2) INFORMATION FOR SEQ ID NO: 538: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 538 
AGTGTTAGAG 



(2) INFORMATION FOR SEQ ID NO: 539: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 539 
GTGTTAGAGT • 



(2) INFORMATION FOR SEQ ID NO: 540: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 540 
TGTTAGAGTG 



(2) INFORMATION FOR SEQ ID NO: 541: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541 
GTTAGAGTGG 



(2) INFORMATION FOR SEQ ID NO: 542: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542 
TTAGAGTGGA 



(2) INFORMATION FOR SEQ ID NO: 543: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

* (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543 
TAGAGTGGAG 



(2) INFORMATION FOR SEQ ID NO: 544: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 544 
AGAGTGGAGG 



(2) INFORMATION FOR SEQ ID NO: 545: 

(i) SEQUENCE CHARACTERISTICS: 

' (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545: 
GAGTGGAGGT 



(2) INFORMATION FOR SEQ ID NO: 546: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546: 
AGTGGAGGTT 



(2) INFORMATION FOR SEQ ID NO: 547: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547: 
GTGGAGGTTT 



(2) INFORMATION FOR SEQ ID NO: 548: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 548 
TGGAGGTTTG 



(2) INFORMATION FOR SEQ ID NO:549: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 549 
GGAGGTTTGA 



(2) INFORMATION FOR SEQ ID NO: 550: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550 
GAGGTTTGAC 



(2) INFORMATION FOR SEQ ID NO: 551: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551 
AGGTTTGACA 



(2) INFORMATION FOR SEQ ID NO: 552: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552 
GGTTTGACAG 



(2) INFORMATION FOR SEQ ID NO:553: 

(i) SEQUENCE C^^ARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
* (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 553 
GTTTGACAGC 



(2) INFORMATION FOR SEQ ID NO:554: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
• (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:554 
TTTGACAGCC 



(2) INFORMATION FOR SEQ ID NO: 555: 

(i) SEQUENCE CHARACTERISTICS: 

' (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555 
TTGACAGCCG 



(2) INFORMATION FOR SEQ ID NO: 556: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 556 
TGACAGCCGC 



(2) INFORMATION FOR SEQ ID NO: 557: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 557 
GACAGCCGCC 



(2) INFORMATION FOR SEQ ID NO: 558: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558 
ACAGCCGCCT 



(2) INFORMATION FOR SEQ ID NO: 559: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559 
CAGCCGCCTA 



(2) INFORMATION FOR SEQ ID NO: 560: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 560 
AGCCGCCTAG 



(2) INFORMATION FOR SEQ ID NO: 561: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561 
GCCGCCTAGC 



(2) INFORMATION FOR SEQ ID NO: 562: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562 
CCGCCTAGCA 



(2) INFORMATION FOR SEQ ID NO: 563: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

, (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 563 
CGCCTAGCAT 



(2) INFORMATION FOR SEQ ID NO: 564: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 564 
GCCTAGCATT 



(2) INFORMATION FOR SEQ ID NO: 565: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 565 
CCTAGCATTT 



(2) INFORMATION FOR SEQ ID NO: 566: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 566 
CTAGCATTTC 



(2) INFORMATION FOR SEQ ID NO: 567: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 567 
TAGCATTTCA 



(2) INFORMATION FOR SEQ ID NO: 568: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 568 
AGCATTTCAT 



(2) INFORjMATION FOR SEQ ID NO:569: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 569 
GCATTTCATC 



(2) INFORMATION FOR SEQ ID NO: 570: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 570 
CATTTCATCA 



(2) INFORMATION FOR SEQ ID NO: 571: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 571 
ATTTCATCAC 



(2) INFORMATION FOR SEQ ID NO: 572: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 572 
TTTCATCACG 



(2) INFORMATION FOR SEQ ID NO: 573: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 3 
TTCATCACGT , 



(2) INFORMATION FOR SEQ ID NO: 574: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 574 
TCATCACGTG 



(2) INFORMATION FOR SEQ ID NO: 575: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 575 
CATCACGTGG 



(2) INFORMATION FOR SEQ ID NO: 576: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 576 
ATCACGTGGC 



(2) INFORMATION FOR SEQ ID NO: 577: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 577 
TCACGTGGCC 



(2) INFORMATION FOR SEQ ID NO: 578: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 578 
CACGTGGCCC 



(2) INFORMATION FOR SEQ ID NO: 579: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 579 
ACGTGGCCCG 



(2) INFORMATION FOR SEQ ID NO: 580: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:580 
CGTGGCCCGA 



(2) INFORMATION FOR SEQ ID NO: 581: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 581 
GTGGCCCGAG 



(2) INFORMATION FOR SEQ ID NO: 582: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582 
TGGCCCGAGA 



(2) INFORMATION FOR SEQ ID NO: 583: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 583 
GGCCCGAGAG 



(2) INFORMATION FOR SEQ ID NO: 584: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 584 
GCCCGAGAGC 



(2) INFORMATION FOR SEQ ID NO: 585: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 585 
CCCGAGAGCT 



(2) INFORMATION FOR SEQ ID NO: 586: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586 
CCGAGAGCTG 



(2) INFORMATION FOR SEQ ID NO: 587: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 587 
CGAGAGCTGC 



(2) INFORMATION FOR SEQ ID NO: 588: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 588 
GAGAGCTGCA 



(2) INFORMATION FOR SEQ ID NO: 589: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 589 
AGAGCTGCAT 



(2) INFORMATION FOR SEQ ID NO: 590: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 590 
GAGCTGCATC 



(2) INFORMATION FOR SEQ ID NO: 591: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591 
AGCTGCATCC 



(2) INFORMATION FOR SEQ ID NO: 592: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 592 
GCTGCATCCG 



(2) INFORMATION FOR SEQ ID NO: 593: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 593 
CTGCATCCGG 



(2) INFORMATION FOR SEQ ID NO: 594: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594 
TGCATCCGGA 



(2) INFORMATION FOR SEQ ID NO: 595: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 595 
GCATCCGGAG ' 



(2) INFORMATION FOR SEQ ID NO: 596: 



-248- 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 596 
CATCCGGAGT 



(2) INFORMATION FOR SEQ ID NO: 597: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 597 
ATCCGGAGTA 



(2) INFORMATION FOR SEQ ID NO: 598: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 8 
TCCGGAGTAC 



(2) INFORMATION FOR SEQ ID NO: 599: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

• (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 599 
CCGGAGTACT 



(2) INFORMATION FOR SEQ ID NO: 600: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 600 
CGGAGTACTT 



(2) INFORMATION FOR SEQ ID NO: 601: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 601 
GGAGTACTTC 



(2) INFORMATION FOR SEQ ID NO: 602: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 602 
GAGTACTTCA - 



(2) INFORMATION FOR SEQ ID NO: 603: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 603 
AGTACTTCAA 



(2) INFORMATION FOR SEQ ID NO: 604: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 604 
GTACTTCAAG 



(2) INFORMATION FOR SEQ ID NO:605: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
• (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:605 
TACTTCAAGA 



(2) INFORMATION FOR SEQ ID NO: 606: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 606 
ACTTCAAGAA 



(2) INFORMATION FOR SEQ ID NO: 607: 

(i) SEQUENCE CHARACTERISTICS: 

* (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 607 
CTTCAAGAAC 



(2) INFORMATION FOR SEQ ID NO: 608: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 608 
TTCAAGAACT 



(2) INFORMATION FOR SEQ ID NO: 609: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 609 
TCAAGAACTG * 



(2) INFORMATION FOR SEQ ID NO: 610: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 610 
CAAGAACTGC 



(2) INFORMATION FOR SEQ ID NO: 611: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 611 
AAGAACTGCT 



(2) INFORMATION FOR SEQ ID NO: 612: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612: 
AGAACTGCTG 10 



(2) INFORMATION FOR SEQ ID NO: 613: 

(i) SEQUENCE CHARACTERISTICS: 

• (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 613: 
GAACTGCTGA 10 



(2) INFORMATION FOR SEQ ID NO: 614: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1305 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 614: 



GAAGAGATTT 


GGGAGAACAT 


GACCTGGATG 


CAGTGGGAAA 


AAGAAATTCA 


CAATCACACA 


60 


AAATACATAT 


ACTCCTTACT 


TGAAAAATCG 


CAGAACCAAC 


AAGAAAAGAA 


TGAACAAGAA 


120 


CTATTGGAAT 


TGGATCAATG 


GGCAAGTTTG 


TGGAATTGGT 


TTGACATAAC 


AAAATGGCTG 


180 


TGGTATATAA 


AAATATTCAT 


AATGGTAGTA 


GGAGGCTTGA 


TAGGTTTAAG 


AATAGTTTTT 


240 


GCTGTACTTT 


CTATAGTGAA 


TAGAGTTAGG 


CAGGGATACT 


CACCATTGTC 


GTTTCAGACC 


300 


CTCCTCCCAA 


CCCCGAGGGG 


ACCCGACAGG 


CCCGAAGGAA 


TCGAAGAAGA 


AGGTGGAGAG 


360 


AGAGACAGAG 


ACAGATCCAC 


TCGATTAGTA 


CACGGATTCT 


TAGCACTTTT 


CTGGGACGAC 


420 


CTGAGGAGCC 


TGTGCCTCTT 


CCTCTACCAC 


CACTTGAGAG 


ACTTACTCTT 


GATTGTAACA 


480 


AGGATTGTGG 


AACTTCTGGG 


ACGCAGGGGA 


TGGGAAGCCC 


TCAAATATTG 


GTGGAACCTC 


540 
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CTAAAGTATT 


GGAGCCAGGA 


ACTGCAGAAG 


AGTGCTGTTA 


TCTTGCTCAA 


TGCCACCGCC 


600 


ATAGCAGTAG 


CTGAGGGGAC 


AGATAGAGTT 


TTAGAAGTAT 


TACAAAGAGC 


TTATAGAGCT 


660 


ATCCTCCACA 


TACCTAGAAG 


AATAAGACAG 


GGCCTCGAAA 


TGGCTTTGCT 


ATAAAATGGG 


720 


TGGCAAGTGA 


GCAAAAAGTA 


GTGTAGTCAG 


ATAGCATGCA 


TCATAAGGGG 


TGGGGGCCAA 


780 


CAACTAACAA 


TGCTGATCGT 


GCCTGGCTAG 


AAGCACAAGA 


GAAGGAAGAA 


GCGGGTTTTC 


840 


CAGTCAAACC 


TCAGGTAGCT 


GTAGATCTTA 


GCCACTTTTT 


AAAAGAAAAG 


GGGGGACTGG 


900 


AAGGGCTAAT 


TCACTCCCAA 


AGAAGACAAG 


ATACACAGTG 


CTGCAAACTA 


TTACCAGTGG 


960 


AGTCAGCGAA 


GATAGAAGAG 


GCCAATGGAG 


GAGAAAACCA 


CAGATTGTTC 


TGTTGGGGAC 


1020 


TTTCCATCCG 


TTGGGGACTT 


TCCAAGGCGG 


CGTGGCCTGG 


GTGACTAGTT 


CCGGTGGGGA 


1080 


CTTTCCAAGA 


AGGCGCGGCC 


TGGGCGGGAC 


TGGGGAGTGG 


CGAGCCCTCA 


GATGCTGCAT 


1140 


ATAAGCAGCT 


GCTTTCTGCT 


GTTACTGGGT 


CTCTCGGGTT 


AGACCAGATC 


TGAGCCTGGG 


1200 


AGCTCTCTGG 


CTAACTAGGG 


AACCCACTGC 


TTAAGCCTCA 


ATAAAGCTTG 


CCTTGAGTGC 


1260 


TTCAAGTAGT 


GTGTGCCCGT 


CTGTTGTGTG 


ACTCTGGTAT 


CTAGA 




1305 



(2) INFORMATION FOR SEQ ID NO: 615: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1208 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615: 

GAAACAATTT GGGATAACAT GACCTGGATG CAGTGGGAAA GAGAAATTGA CAATTACACA 60 

AACATAATAT ACACCTTAAT TGAAGAATCG CAGAACCAAC AAGAAAAAAA TGAACTAGAA 120 

TTATTGGAAT TGGATAAATG GGCAAATTTG TGGAATTGGT TTAGTATATC AAACTGGCTA 180 

TGGTATATAA AATTATTCAT AATGGTAGTA GGAGGCTTGG TAGGTTTAAG AATAGTTTTT 240 

ACTGTACTTT CTATAGTGAA TAGAGTTAGG CAGGGATACT CACCATTGTC GTTTCAGACC 300 

CACCTCCCAA CCCCGAAGGG ACCCGACAGG CCAGAAGGAA TCGAAGAAGA AGGTGGAGAG 360 

AGAGACAGAG GCAGCTCCAC TCGATTAGTG CACGGATTCT TAGCACTTTT CTGGGACGAC 420 

CTGAGGAGTC TGTGCCTCTT CAGCTACCAC CACTTGAGAG ACTTACTCTT GATTGTAACG 480 

AGGATTGTGG AACTTCTGGG ACGCAGGGGA TGGGAAGCCC TCAAATACTG GTGGAATCTC 540 
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CTGCAGTATT GGAGGCAGGA ACTACAGAAG AGTGCTGTTA GCTTGTTCAA TGGCACGGCC 600 
ATAGCAGTAG CTGAGGGGAC AGATAGAGTT ATAGAAGCTT TACGAAGGGC TTATAGAGCT 660 
ATTCTCCACA TACCTAGAAG AATAAGACAG GGCTTAGAAA GGGCTTTGCT ATAAAATGGG 720 
TGGCAAGTGG TCAGAAAGTA GTGTGGTTAG AAGGCATGTA CCTTTAAGAC AAGGCAGCTA 780 

TAGATCTTAG CCGCTTTTTA AAAGAAAAGG GGGGACTGGA AGGGCTAATT CACTCACAGA 840 

GAAGATCAGT TGAACCAGAA GAAGATAGAA GAGGCCATGA AGAAGAAAAC AACAGATTGT 9 00 

TCCGTTTGTT CCGTTGGGGA CTTTCCAGGA GACGTGGCCT GAGTGATAAG CCGCTGGGGA 960 

CTTTCCGAAG AGGCGTGACG GGACTTTCCA AGGCGACGTG GCCTGGGCGG GACTGGGGAG 1020 

TGGCGAGCCC TCAGATGCTG CATATAAGCA GCTGCTTTCT GCCTGTACTG GGTCTCTCTG 1080 

GTTAGACCAG ATCTGAGCCT GGGAGCTCTC TGGCTAACTA GGGAACCCAC TGCTTAAGCC 1140 

TCAATAAAGC TTGCCTTGAG TGCTTCAAGT AGTGTGTGCC CGTCTGTTGT GTGACTCTGG 1200 

TATCTAGA 1208 



(2) INFORMATION FOR SEQ ID NO: 616: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 616: 
TGGAAGGGCT AATTTGGT 18 



(2) INFORMATION FOR SEQ ID NO:617: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617: 
ATCTTCCCTA AAAAATTAGC CTGTC 25 



(2) INFORMATION FOR SEQ ID NO: 618: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 618 
AGGCTCAGAT CTGGTCTAAC 



(2) INFORMATION FOR SEQ ID NO: 619: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
(€) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 619 
AGCAGCAGGA AGCACTATGG 



(2) INFORMATION FOR SEQ ID NO: 620: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 620 
TGCTAGAGAT TTTCCACAC 



(2) INFORMATION FOR SEQ ID NO: 621: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 621 
AGTGAATAGA GTTAGGCAGG 
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(2) INFORMATION FOR SEQ ID NO: 622: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:622 
GTAAGACAGT ATGATCAGAT A 



(2) INFORMATION FOR SEQ ID NO: 62 3: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 623 
TTGTAGGGAA TTCCAAATTC C 



(2) INFORMATION FOR SEQ ID NO: 624: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(p) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 624 
CAGGATCCTA CACCTGTCAA CATAAT 



(2) INFORMATION FOR SEQ ID NO: 625: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 625 
GGGAATTCCT TATTCCTGCT TG 



(2) INFORMATION FOR SEQ ID NO: 626: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 626 
CCAGAAGTTC CACAATCC 



(2) INFORMATION FOR SEQ ID NO: 627: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627 
TTCTTCTAGG TATGTGGAG 



(2) INFORMATION FOR SEQ ID NO: 628: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 628 
AGTGAATTAG CCCTTCCAG 



(2) INFORMATION FOR SEQ ID NO: 629: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 629 
TGCTAGAGAT TTTCCACAC 



(2) INFORMATION FOR SEQ ID NO: 630: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 630 
TGCTCTGGAA AACTCAT 



(2) INFORMATION FOR SEQ ID NO: 631: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 631 
CTTTCTATAG TGAATAGAG 



(2) INFORMATION FOR SEQ ID NO: 632: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 632 
TATTGGAGTC AGGAACT 



(2) INFORMATION FOR SEQ ID NO: 633: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 633: 
GGTCTAACCA GAGAGAC 



(2) INFORMATION FOR SEQ ID NO: 634: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 634: 

Ala Val Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala 
15 10 



(2) INFORMATION FOR SEQ ID NO: 635: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 635: 

Pro Thr Ser Gin Ser Arg Gly Asp Pro Thr Gly Pro Lys Glu Lys 
15 10 15 

Lys Lys Val Glu Arg Glu Thr Glu Thr Asp Pro Phe Asp Thr, Asp 
20 25 30 

Pro His 



(2) INFORMATION FOR SEQ ID NO: 636: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 6: 

Pro Ser Ser Gin Pro Arg Gly Asp Pro Thr Gly Pro Lys Glu Ser Lys 
15 10 15 

Lys Lys Val Glu Arg Glu Thr Glu Thr Asp Pro Leu Asp Tyr Thr Asp 
20 25 30 

Ser His 



(2) INFORMATION FOR SEQ ID NO: 637: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 637: 

Pro Thr Ser Gin Pro Arg Arg Asp Pro Thr Gly Gin Lys Glu Ser Lys 
15 10 15 

Lys Lys Val Glu Arg Glu Thr Glu Ala Ala Pro Leu Asp Cys Thr Asp 
20 25 30 

Ser His 

(2) INFORMATION FOR SEQ ID NO: 638: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 638: 

Asp Pro Pro Pro Asn Pro Glu Gly Thr Arg Gin Ala Arg Arg Asn Arg 
1-5 10 15 

Arg Arg Arg Trp Arg Glu Arg Gin Arg Gin lie His Ser He' Ser Glu 
20 25 30 

Arg He Leu Ser Thr Tyr Leu Gly Arg Ser Ala Glu Pro Val Pro Leu 
35 40 45 

Gin Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys Asn Glu Asp Cys 
50 55 60 
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Gly Thr Ser Gly Thr Gin Gly Val Gly Ser Pro Gin lie Leu Val Glu 
65 70 75 80 

Ser Pro Thr Val Leu Glu Ser Gly Thr Lys Glu Cys Cys Leu Ala Gin 
85 90 95 

Cys His Ser His Ser Ser Ser Gly Asp Arg 
100 105 



(2) INFORMATION FOR SEQ ID NO: 639: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(G) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 639: 

Asp Pro Pro Pro Asn Pro Glu Gly Thr Arg Gin Ala Arg Arg Asn Arg 
15 10 15 

Arg Arg Arg Trp Arg Glu Arg Gin Arg Gin lie His Ser lie Ser Thr 
20 25 30 

Arg lie Leu Ser Thr Phe Leu Gly Arg Pro Glu Glu Pro Val Pro Leu 
' 35 40 45 

Pro Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys Asn Lys Asp Cys 
50 55 60 

Gly Thr Ser Gly Thr Gin Gly Met Gly Ser Pro Gin He Leu Val Glu 
65 70 75 80 

Pro Pro Lys Val Leu Glu Pro Gly Thr Ala Glu Glu- Cys Cys Tyr Leu 
85 90 95 

Ala Gin Cys His Arg His Ser Ser Ser Gly Asp Arg 
100 105 



(2) INFORMATION FOR SEQ ID NO: 640: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 640: 

Asp Pro Pro Pro Asn Pro Glu Gly Thr Arg Gin Ala Arg Arg. Asn Arg 
15 10 15 
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Arg Arg Arg Trp Arg Glu Arg Gin Arg Gin Leu His Ser lie Ser Ala 
20 25 30 

Arg lie Leu Ser Thr Phe Leu Gly Arg Pro Glu Glu Ser Val Pro Leu 
35 40 45 

Gin Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys Asn Glu Asp Cys 
50 55 60 

Gly Thr Ser Gly Thr Gin Gly Met Gly Ser Pro Gin He Leu Val Glu 
65 70 75 80 

Ser Pro Ala Val Leu Glu Ala Gly Thr Thr Glu Glu Cys Cys Leu Val 
85 90 95 

Gin Trp His Gly His Ser Ser Ser Gly Asp Arg 
100 105 



INFORMATION FOR SEQ ID NO; 641: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 641: 

Glu Gin He Trp Asn Asn Met Thr Trp Met Glu Trp Asp Arg Glu He 
1 5 10 15 

Asn Asfi Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn 
20 25 30 

Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala 
35 40 45 

Ser Leu Trp Asn Trp Phe Asn He Thr Asn Trp Leu Trp Tyr He Lys 
50 55 60 

Leu Phe He Met He Val Gly Gly Leu Val Gly Leu Arg He Val Phe 
65 70 75 80 

Ala Val Leu Ser He Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu 
85 90 95 

Ser Phe Gin Thr His Leu Pro He Pro Arg Gly Pro Asp Arg' Pro Glu 
100 105 110 

Gly He Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser He Arg 
115 120 125 

Leu Val Asn Gly Ser Leu Ala Leu He Trp Asp Asp Leu Arg Ser Leu 
130 135 140 

Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu He Val Thr 
145 150 155 160 
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Arg lie Val Glu Leu Leu Gly Arg 
165 

Trp Trp Asn Leu Leu Gin Tyr Trp 
180 

Val Asn Leu Leu Asn Ala Thr Ala 
195 200 

Arg Val He Glu Val Leu Gin Ala 
210 215 

Pro Arg Arg lie Arg Gin Gly Leu 
225 230 



Arg Gly Trp Glu Ala Leu Lys Tyr 
170 175 

Ser Gin Glu Leu Lys Asn Ser Ala 
185 190 

He Ala Val Ala Glu Gly Thr Asp 
205 

Ala Tyr Arg Ala He Arg His He 
220 

Glu Arg He Leu Leu 
235 



INFORMATION FOR SEQ ID NO: 642: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 642: 

Glu Glu He Trp Glu Asn Met Thr Trp Met Gin Trp Glu Lys Glu He 
15 10 15 

His Asn His Thr Lys Tyr He Tyr Ser Leu Leu Glu Lys Ser Gin Asn 
20 25 30 

Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Gin Trp Ala 
35 40 45 

Ser Leu Trp Asn Trp Phe Asp He Thr Lys Trp Leu Trp Tyr He Lys 
50. 55 60 

He Phe He Met Val Val Gly Gly Leu He Gly Leu Arg He' Val Phe 
65 70 75 80 

Ala Val Leu Ser He Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu 
85 90 95 

Ser Phe Gin Thr Leu Leu Pro Thr Pro Arg Gly Pro Asp Arg Pro Glu 
100 105 110 

Gly He Glu Glu Met Gly Gly Glu Arg Asp Arg Asp Arg Ser Thr Arg 
115 120 125 

Leu Val His Gly Phe Leu Ala Leu Phe Trp Asp Asp Leu Arg Ser Leu 
130 135 140 

Cys Leu Phe Leu Tyr His His Leu Arg Asp Leu Leu Leu He Val Thr 
145 150 155 160 
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Arg lie Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr 
165 170 175 

Trp Trp Asn Leu Leu Lys Tyr Trp Ser Gin Glu Leu Gin Lys Ser Ala 
180 185 190 

Val lie Leu Leu Asn Ala Thr Ala lie Ala Val Ala Glu Gly Thr Asp 
195 200 205 

Arg Val Leu Glu Val Leu Gin Arg Ala Tyr Arg Ala lie Leu His lie 
210 215 220 

Pro Arg Arg lie Arg Gin Gly Leu Glu Met Ala Leu Leu 
225 230 235 



INFORMATION FOR SEQ ID NO: 643: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 643: 

Glu Thr lie Trp Asp Asn Met Thr Trp Met Gin Trp Glu Arg Glu lie 
15 10 15 

Asp Asn Tyr Thr Asn lie lie Tyr Thr Leu lie Glu Glu Ser Gin Asn 
20 25 30 • 

Gin Gin Glu Lys Asn Glu Leu Glu Leu Leu Glu Leu Asp Lys Trp Ala 
35 40 45 

Asn Leu Trp Asn Trp Phe Ser lie Ser Asn Trp Leu Trp Tyr He Lys 
50 55 60 

Leu Phe He Met Val Val Gly Gly Leu Val Gly Leu Arg He Val Phe 
65 70 75 80 

Thr Val Leu Ser He Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu 
85 90 95 

Ser Phe Gin Thr His Leu Pro Thr Pro Lys Gly Pro Asp Arg Pro Glu 
100 105 110 

Gly He Glu Glu Glu Gly Gly Glu Arg Asp Arg Gly Ser Ser Thr Arg 
115 120 125 

Leu Val His Gly Phe Leu Ala Leu Phe Trp Asp Asp. Leu Arg Ser Leu 
130 135 140 

Cys Leu Phe Ser Tyr His His Leu Arg Asp Leu Leu Leu He Val Thr 
145 150 155 160 
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Arg lie Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr 
165 170 175 

Trp Trp Asn Leu Leu Gin Tyr Trp Arg Gin Glu Leu Gin Lys Ser Ala 
180 185 190 

Val Ser Leu Phe Asn Gly Thr Ala He Ala Val Ala Glu Gly Thr Asp 
195 200 205 

Arg Val He Glu Ala Leu Arg Arg Ala Tyr Arg Ala lie Leu His lie 
210 215 220 

Pro Arg Arg He Arg Gin Gly Leu Glu Arg Ala Leu Leu 
225 230 235 



(2) INFORMATION FOR SEQ ID NO: 644: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 206 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644: 

Met Gly Gly Lys Trp Ser Lys Ser Ser Val He Gly Trp Pro Ala Val 
15 10 15 

Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala Asp Gly Val Gly Ala 
20 25 30 

Val Ser Arg Asp Leu Glu Lys His Gly Ala He Thr Ser Ser Asn Thr 
35 40 45 

Ala Ala Asn Asn Ala Ala Cys Ala Trp Leu Glu Ala Gin Glu- Glu Glu 
50 55 60 

Glu Val Gly Phe Pro Val Thr Pro Gin Val Pro Leu Arg Pro Met Thr 
65 70 75 80 

Tyr Lys Ala Ala Val Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly 
85 90 95 

Leu Glu Gly Leu He His Ser Gin Arg Arg Gin Asp He Leu Asp Leu 
100 105 110 

Trp He Tyr His Thr Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr 
115 120 125 

Pro Gly Pro Gly Val Arg Tyr Pro Leu Thr Phe Gly Trp Cys Tyr Lys 
130 135 140 

Leu Val Pro Val Glu Pro Asp Lys Val Glu Glu Ala Asn Lys Gly Glu 
145 150 155 160 
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Asn Thr Ser Leu Leu His Pro Val Ser Leu His Gly Met Asp Asp Pro 
165 170 175 

Glu Arg Glu Val Leu Glu Trp Arg Phe Asp Ser Arg Leu Ala Phe His 
180 185 190 

His Val Ala Arg Glu Leu His Pro Glu Tyr Phe Lys Asn Cys 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 645: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 645: 

Met Gly Gly Lys Ala Lys Ser Ser Val Val Arg His Ala Ser Gly 
1 5 10 15 



Val Gly Ala Asn Asn Gly Cys 
20 



(2) INFORMATION FOR SEQ ID NO: 646: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii} MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:646: 

Met Gly Gly Lys Trp Ser Glu Ser Ser Val Val Arg Arg His Val Pro 
1 5 10 '15 

Leu Arg Gin Gly Ser Tyr Arg Ser Pro Leu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 647: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 647: 



CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGAGGCGT GGCCTGGGCG GGACTGGGGA 
GTGGCGAGCC CTCA 



(2) INFORMATION FOR SEQ ID NO: 648: 

(i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 648: 



CTGTTGGGGA CTTTCCATCC GTTGGGGACT TTCCAAGGCG GCGTGGCCTG GGTGACTAGT 
TCCGGTGGGG ACTTTCCA 



(2) INFORMATION FOR SEQ ID NO: 649: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(p) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 649: 



CCGTTTGTTC CGTTGGGGAC TTTCCAGGAG ACGTGGCCTG AGTGACTAAG CCGCTGGGGA 
CTTTCCG 



(2) INFORMATION FOR SEQ ID NO: 650: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 650: 
ATGGGTGGCA AGTGGTCAAA AAGTAGTGTG ATTGGATGGC CTGCTGTAAG GGAAAGAATG 
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AGACGAGCTG AGCCAGCAGC AGATGGGGTG GGAGCAGTAT CTCGAGACCT AGAAAAACAT 120 

GGAGCAATCA CAAGTAGCAA TACAGCAGCT AACAATGCTG CTTGTGCCTG GCTAGAAGCA 180 

CAAGAGGAGG AAGAGGTGGG TTTTCCAGTC ACACCTCAGG TACCTTTAAG ACCAATGACT 240 

TACAAGGCAG CTGTAGATCT TAGCCACTTT TTAAAAGAAA AGGGGGGACT GGAAGGGCTA 300 

ATTCACTCCC AAAGAAGACA AGATATCCTT GATCTGTGGA TCTACCACAC ACAAGGCTAC 360 

TTCCCTGATT GGCAGAACTA CACACCAGGG CCAGGGGTCA GATATCCACT GACCTTTGGA 420 

TGGTGCTACA AGCTAGTACC AGTTGAGCCA GATAAGGTAG AAGAGGCCAA TAAAGGAGAG 480 

AACACCAGCT TGTTACACCC TGTGAGCCTG CATGGAATGG ATGACCCTGA GAGAGAAGTG 540 

TTAGAGTGGA GGTTTGACAG CCGCCTAGCA TTTCATCACG TGGCCCGAGA GCTGCATCCG 600 

GAGTACTTCA AGAACTGCTG A 621 

(2) INFORMATION FOR SEQ ID NO: 651: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1596 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 651: 



GAACAGATTT 


GGAATAACAT 


GACCTGGATG 


GAGTGGGACA 


GAGAAATTAA 


CAATTACACA 


60 


AGCTTAATAC 


ACTCCTTAAT 


TGAAGAATCG 


CAAAACCAGC 


AAGAAAAGAA 


TGAACAAGAA 


120 


TTATTGGAAT 


TAGATAAATG 


GGCAAGTTTG 


TGGAATTGGT 


TTAACATAAC 


AAATTGGCTG 


180 


TGGTATATAA 


AATTATTCAT 


AATGATAGTA 


GGAGGCTTGG 


TAGGTTTAAG 


AATAGTTTTT 


240 


GCTGTACTTT 


CTATAGTGAA 


TAGAGTTAGG 


CAGGGATATT 


CACCATTATC 


GTTTCAGACC 


300 


CACCTCCCAA 


TCCCGAGGGG 


ACCCGACAGG 


CCCGAAGGAA 


TAGAAGAAGA 


AGGTGGAGAG 


360 


AGAGACAGAG 


ACAGATCCAT 


TCGATTAGTG 


AACGGATCCT 


TAGCACTTAT 


CTGGGACGAT 


420 


CTGCGGAGCC 


TGTGCCTCTT 


CAGCTACCAC 


CGCTTGAGAG 


ACTTACTCTT 


GATTGTAACG 


480 


AGGATTGTGG 


AACTTCTGGG 


ACGCAGGGGG 


TGGGAAGCCC 


TCAAATATTG 


GTGGAATCTC 


540 


CTACAGTATT 


GGAGTCAGGA 


ACTAAAGAAT 


AGTGCTGTTA 


ACTTGCTCAA 


TGCCACAGCC 


600 


ATAGCAGTAG 


CTGAGGGGAC 


AGATAGGGTT 


ATAGAAGTAT 


TACAAGCAGC 


TTATAGAGCT 


660 


ATTCGCCACA 


TACCTAGAAG 


AATAAGACAG 


GGCTTGGAAA 


GGATTTTGCT 


ATAAGATGGG 


720 


TGGCAAGTGG 


TCAAAAAGTA 


GTGTGATTGG 


ATGGCCTGCT 


GTAAGGGAAA 


GAATGAGACG 


780 
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AGCTGAGCCA 


GCAGCAGATG 


GGGTGGGAGC 


AGTATCTCGA 


GACCTAGAAA 


AACATGGAGC 


840 


AATCACAAGT 


AGCAATACAG 


CAGCTAACAA 


TGCTGCTTGT 


GCCTGGCTAG 


AAGCACAAGA 


900 


GGAGGAAGAG 


GTGGGTTTTC 


CAGTCACACC 


TCAGGTACCT 


TTAAGACCAA 


TGACTTACAA 


960 


GGCAGCTGTA 


GATCTTAGCC 


ACTTTTTAAA 


AGAAAAGGGG 


GGACTGGAAG 


GGCTAATTCA 


1020 


CTCCCAAAGA 


AGACAAGATA 


TCCTTGATCT 


GTGGATCTAC 


CACACACAAG 


GCTACTTCCC 


1080 


TGATTGGCAG 


AACTACACAC 


CAGGGCCAGG 


GGTCAGATAT 


CCACTGACCT 


TTGGATGGTG 


1140 


CTACAAGCTA 


GTACCAGTTG 


AGCCAGATAA 


GGTAGAAGAG 






1 0 n A 


CAGCTTGTTA 


CACCCTGTGA 


GCCTGCATGG 


AATGGATGAC 


CCTGAGAGAG 


AAGTGTTAGA 


1260 


GTGGAGGTTT 


GACAGCCGCC 


TAGCATTTCA 


TCACGTGGCC 


CGAGAGCTGC 


ATCCGGAGTA 


1320 


CTTCAAGAAC 


TGCTGACATC 


GAGCTTGCTA 


CAAGGGACTT 


TCCGCTGGGG 


ACTTTCCAGG 


1380 


GAGGCGTGGC 


CTGGGCGGGA 


CTGGGGAGTG 


GCGAGCCCTC 


AGATGCTGCA 


TATAAGCAGC 


1440 


TGCTTTTTGC 


CTGTACTGGG 


TCTCTCTGGT 


TAGACCAGAT 


CTGAGCCTGG 


GAGCTCTCTG 


1500 


GCTAACTAGG 


GAACCCACTG 


CTTAAGCCTC 


AATAAAGCTT 


GCCTTGAGTG 


CTTCAAGTAG 


1560 


TGTGTGCCCG 


TCTGTTGTGT 


GACTCTGGTA 


ACTAGA 






1596 



(2) INFORMATION FOR SEQ ID NO: 652: 

(i) SEQUENCE CHARACTERISTICS: 

• (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 652: 
GCTTTTTGCC 10 



(2) INFORMATION FOR SEQ ID NO: 653: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:653: 



CTTTTTGCCT 



10 
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(2) INFORMATION FOR SEQ ID NO: 654: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 654 
TTTTTGCCTG 



(2) INFORMATION FOR SEQ ID NO: 655: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 655 
TTTTGCCTGT 



(2) INFORMATION FOR SEQ ID NO: 656: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 656 
TTTGCCTGTA 



(2) INFORMATION FOR SEQ ID NO: 657: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 657 



TTGCCTGTAC 



(2) INFORMATION FOR SEQ ID NO: 658: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 658 
TGCCTGTACT 



(2) INFORMATION FOR SEQ ID NO: 659: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 659 
GCCTGTACTG 



(2) INFORMATION FOR SEQ ID NO: 660: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 660 
CCTGTACTGG 



(2) INFORMATION FOR SEQ ID NO: 661: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 661 



CTGTACTGGG 
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(2) INFORMATION FOR SEQ ID NO: 662: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 662 
TGTACTGGGT 



(2) INFORMATION FOR SEQ ID NO: 663: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 663 
GTACTGGGTC 



(2) INFORMATION FOR SEQ ID NO: 664: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 664 
TACTGGGTCT 



(2) INFORMATION FOR SEQ ID NO: 665: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 665 



ACTGGGTCTC 



(2) INFORMATION FOR SEQ ID NO: 666: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 666 
CTGGGTCTCT 



{2) INFORMATION FOR SEQ ID NO: 667: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 667 
TGGGTCTCTC 



(2) INFORMATION FOR SEQ ID NO: 668: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 668 
GGGTCTCTCT 



(2) INFORMATION FOR SEQ ID NO: 669: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 669 



GGTCTCTCTG 
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(2) INFORMATION FOR SEQ ID NO: 670: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 670 
GTCTCTCTGG 



(2) INFORMATION FOR SEQ ID NO: 671: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 671 
TCTCTCTGGT 



(2) INFORMATION FOR SEQ ID NO: 672: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 672 
CTCTCTGGTT 



(2) INFORMATION FOR SEQ ID NO: 673: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

• (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 673 



TCTCTGGTTA 



(2) INFORMATION FOR SEQ ID NO: 674: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 674 
CTCTGGTTAG 



(2) INFORMATION FOR SEQ ID NO: 675: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 675 
TCTCTGGTTA 



(2) INFORMATION FOR SEQ ID NO: 676: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 676 
CTGGTTAGAC 



(2) INFORMATION FOR SEQ ID NO: 677: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 677 



TGGTTAGACC 



(2) INFORMATION FOR SEQ ID NO: 678: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 678 
GGTTAGACCA 



(2) INFORMATION FOR SEQ ID NO: 679: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 679 
GTTAGACCAG 



(2) INFORMATION FOR SEQ ID NO: 680: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

* (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 680 
TTAGACCAGA 



(2) INFORMATION FOR SEQ ID NO: 681: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 681 



TAGACCAGAT 
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(2) INFORMATION FOR SEQ ID NO: 682: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 682 
AGACCAGATC 



(2) INFORMATION FOR SEQ ID NO: 683: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 683 
GACCAGATCT 



(2) INFORMATION FOR SEQ ID NO:684: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 684 
ACCAGATCTG 



(2) INFORMATION FOR SEQ ID NO: 685: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 685 



CCAGATCTGA 
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(2) INFORMATION FOR SEQ ID NO: 686: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 686 
CAGATCTGAG 



(2) INFORMATION FOR SEQ ID NO: 687: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
' (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 687 
AGATCTGAGC 



(2) INFORMATION FOR SEQ ID NO: 688: 

(i) SEQUENCE CHARACTERISTICS: 

* (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 688 
GATCTGAGCC 



(2) INFORMATION FOR SEQ ID NO: 689: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 689 



ATCTGAGCCT 
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(2) INFORMATION FOR SEQ ID NO: 690: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 690 
TCTGAGCCTG 



(2) INFORMATION FOR SEQ ID NO: 691: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 691 
CTGAGCCTGG 



(2) INFORMATION FOR SEQ ID NO: 692: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 692 
TGAGCCTGGG 



(2) INFORMATION FOR SEQ ID NO: 693: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 693 



GAGCCTGGGA 
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(2) INFORMATION FOR SEQ ID NO: 694: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 694 
AGCCTGGGAG 



(2) INFORMATION FOR SEQ ID NO: 695: 

(i) SEQUENCE CHARACTERISTICS: 

' (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 695 
GCCTGGGAGC 



(2) INFORMATION FOR SEQ ID NO: 696: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 696 
CCTGGGAGCT 



(2) INFORMATION FOR SEQ ID NO: 697: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 697 
CTGGGAGCTC 
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(2) INFORMATION FOR SEQ ID NO: 69 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 698 
TGGGAGCTCT 



(2) INFORMATION FOR SEQ ID NO: 699: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 699 
GGGAGCTCTC 



(2) INFORMATION FOR SEQ ID NO: 700: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 700 
GGAGCTCTCT 



(2) INFORMATION FOR SEQ ID NO: 701: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

• (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 701 



GAGCTCTCTG 
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(2) INFORMATION FOR SEQ ID NO: 702: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:702 
AGCTCTCTGG 



(2) INFORMATION FOR SEQ ID NO: 703: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 703 
GCTCTCTGGC 



(2) INFORMATION FOR SEQ ID NO: 704: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 704 
CTCTCTGGCT 



(2) INFORMATION FOR SEQ ID NO: 705: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:705 

TCTCTGGCTA 
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(2) INFORMATION FOR SEQ ID NO: 706: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 706 
CTCTGGCTAA 



(2) INFORMATION FOR SEQ ID NO: 707: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 707 
TCTGGCTAAC 



(2) INFORMATION FOR SEQ ID NO: 708: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

• (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 708 
CTGGCTAACT 



(2) INFORMATION FOR SEQ ID NO: 7 09: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 709 



TGGCTAACTA 
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(2) INFORMATION FOR SEQ ID NO: 710: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 710 
GGCTAACTAG 



(2) INFORMATION FOR SEQ ID NO: 711: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 711 
GCTAACTAGG 



(2) INFORMATION FOR SEQ ID NO: 712: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 712 
CTAACTAGGG 



(2) INFORMATION FOR SEQ ID NO: 713: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 713 



TAACTAGGGA 
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(2) INFORMATION FOR SEQ ID NO: 714: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 714 
AACTAGGGAA 



(2) INFORMATION FOR SEQ ID NO: 715: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

• (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 715 
ACTAGGGAAC 



(2) INFORMATION FOR SEQ ID NO: 716: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 716 
CTAGGGAACC 



(2) INFORMATION FOR SEQ ID NO: 7 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 717 
TAGGGAACCC 
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(2) INFORMATION FOR SEQ ID NO: 718: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 718 
AGGGAACCCA 



(2) INFORMATION FOR SEQ ID NO: 719: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 719 
GGGAACCCAC 



(2) INFORMATION FOR SEQ ID NO: 720: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 0 
GGAACCCACT 



(2) INFORMATION FOR SEQ ID NO: 721: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 721 
GAACCCACTG 
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(2) INFORMATION FOR SEQ ID NO: 722: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 722 
AACCCACTGC 



(2) INFORMATION FOR SEQ ID NO: 723: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

• (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 723 
ACCCACTGCT 



(2) INFORMATION FOR SEQ ID NO: 724: 

(i) SEQUENCE CHARACTERISTICS: 

* (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 724 
CCCACTGCTT 



(2) INFORMATION FOR SEQ ID NO: 725: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 725 



CCACTGCTTA 
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(2) INFORMATION FOR SEQ ID NO: 726: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 726 
CACTGCTTAA 



(2) INFORMATION FOR SEQ ID NO: 727: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 727 
ACTGCTTAAG 



(2) INFORMATION FOR SEQ ID NO: 728: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 728 
CTGCTTAAGC 



(2) INF0R14ATI0N FOR SEQ ID NO: 729: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 729 



TGCTTAAGCC 
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(2) INFORMATION FOR SEQ ID NO: 730: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 730 
GCTTAAGCCT 



(2) INFORMATION FOR SEQ ID NO: 731: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 731 
CTTAAGCCTC 



(2) INFORMATION FOR SEQ ID NO: 732: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 732 
TTAAGCCTCA 



(2) INFORMATION FOR SEQ ID NO: 733: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 733 



TAAGCCTCAA 
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(2) INFORMATION FOR SEQ ID NO: 734: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 734 
AAGCCTCAAT 



(2) INFORMATION FOR SEQ ID NO: 735: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 735 
AGCCTCAATA 



(2) INFORMATION FOR SEQ ID NO: 736: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:736 
GCCTCAATAA 



(2) INFORMATION FOR SEQ ID NO: 737: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:737 
CCTCAATAAA 
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(2) INFORMATION FOR SEQ ID NO: 738: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 738 
CTCAATAAAG 



(2) INFORMATION FOR SEQ ID NO: 739: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 739 
TCAATAAAGC 



(2) INFORMATION FOR SEQ ID NO: 740: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 740 
CAATAAAGCT • 



(2) INFORMATION FOR SEQ ID NO: 741: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 741 



AATAAAGCTT 
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(2) INFORMATION FOR SEQ ID NO: 742: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 742 
ATAAAGCTTG 



(2) INFORMATION FOR SEQ ID NO: 743: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
.(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 743 
TAAAGCTTGC 



(2) INFORMATION FOR SEQ ID NO: 744: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 744 
AAAGCTTGCC 



(2) INFORMATION FOR SEQ ID NO: 745: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:745 



AAGCTTGCCT 
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(2) INFORMATION FOR SEQ ID NO: 746: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 746 
AGCTTGCCTT 



(2) INFORMATION FOR SEQ ID NO: 747: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:747 
GCTTGCCTTG * 



(2) INFORMATION FOR SEQ ID NO: 748: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 748 
CTTGCCTTGA 



(2) INFORMATION FOR SEQ ID NO: 749: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 749 



TTGCCTTGAG 
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(2) INFORMATION FOR SEQ ID NO: 750: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 750 
TGCCTTGAGT 



(2) INFORMATION FOR SEQ ID NO: 751: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

* (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 751 
GCCTTGAGTG 



(2) INFORMATION FOR SEQ ID NO: 752: 

(i) SEQUENCE CHARACTERISTICS: 

' (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 752 
CCTTGAGTGC 



(2) INFORMATION FOR SEQ ID NO: 753: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 753 



CTTGAGTGCT 



-295- 



(2) INFORMATION FOR SEQ ID NO: 754: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 754 
TTGAGTGCTT 



(2) INFORMATION FOR SEQ ID NO: 755: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 755 
TGAGTGCTTC 



(2) INFORMATION FOR SEQ ID NO: 756: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 756 
GAGTGCTTCA 



(2) INFORMATION FOR SEQ ID NO: 757: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 757 



AGTGCTTCAA 
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(2) INFORMATION FOR SEQ ID NO: 758: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 758 
GTGCTTCAAG 



(2) INFORMATION FOR SEQ ID NO: 759: 

(i) SEQUENCE CHARACTERISTICS: 

• (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 759 
TGCTTCAAGT 



(2) INFORMATION FOR SEQ ID NO: 760: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 760 
GCTTCAAGTA 



(2) INFORMATION FOR SEQ ID NO: 761: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 761 



CTTCAAGTAG 
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(2) INFORMATION FOR SEQ ID NO: 762: 

(i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 762 
TTCAAGTAGT 



(2) INFORMATION FOR SEQ ID NO: 763: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 763 
TCAAGTAGTG 



(2) INFORMATION FOR SEQ ID NO: 764: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 764 
CAAGTAGTGT 



(2) INFORMATION FOR SEQ ID NO: 765: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
'(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 765 



AAGTAGTGTG 
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(2) INFORMATION FOR SEQ ID NO: 766: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 766 
AGTAGTGTGT 



(2) INFORMATION FOR SEQ ID NO: 767: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 767 
GTAGTGTGTG 



(2) INFORMATION FOR SEQ ID NO: 768: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 768 
TAGTGTGTGC 



(2) INFORMATION FOR SEQ ID NO: 769: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 769 



AGTGTGTGCC 
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(2) INFORMATION FOR SEQ ID NO: 770: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 770 
GTGTGTGCCC 



(2) INFORMATION FOR SEQ ID NO: 771: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 771 
TGTGTGCCCG 



(2) INFORMATION FOR SEQ ID NO: 772: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 772 
GTGT(X:CCGT 



(2) INFORMATION FOR SEQ ID NO: 773: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 773 



TGTGCCCGTC 
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(2) INFORMATION FOR SEQ ID NO: 774: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 774 
GTGCCCGTCT 



(2) INFORMATION FOR SEQ ID NO: 775: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 775 
TGCCCGTCTG 



(2) INFORMATION FOR SEQ ID NO: 776: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:776 
GCCCGTCTGT 



(2) INFORMATION FOR SEQ ID NO: 777: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 777 



CCCGTCTGTT 
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(2) INFORMATION FOR SEQ ID NO: 778: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 778 
CCGTCTGTTG 



(2) INFORMATION FOR SEQ ID NO: 779: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 779 
CGTCTGTTGT 



(2) INFORMATION FOR SEQ ID NO: 780: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 780 
GTCTGTTGTG 



(2) INFORMATION FOR SEQ ID NO: 781: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 781 



TCTGTTGTGT 
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(2) INFORMATION FOR SEQ ID NO: 782: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 782 
CTGTTGTGTG 



(2) INFORMATION FOR SEQ ID NO: 783: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
, (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 783 
TGTTGTGTGA 



(2) INFORMATION FOR SEQ ID NO: 784: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 784 
GTTGTGTGAC 



(2) INFORMATION FOR SEQ ID NO: 785: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 785 



TTGTGTGACT 
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(2) INFORMATION FOR SEQ ID NO: 786: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 786 
TGTGTGACTC 



(2) INFORMATION FOR SEQ ID NO: 787: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 787 
GTGTGACTCT 



(2) INFORMATION FOR SEQ ID NO: 7 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 788 
TGTGTGACTC 



(2) INFORMATION FOR SEQ ID NO: 7 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 789 



GTGTGACTCT 
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(2) INFORMATION FOR SEQ ID NO: 790: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 790 
TGTGACTCTG 



(2) INFORMATION FOR SEQ ID NO: 791: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 791 
GTGACTCTGG 



(2) INFORMATION FOR SEQ ID NO: 792: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 792 
TGACTCTGGT 



(2) INFORMATION FOR SEQ ID NO: 793: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 3 



GACTCTGGTA 
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{2) INFORMATION FOR SEQ ID NO: 794: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 794 
ACTCTGGTAA 



(2) INFORMATION FOR SEQ ID NO: 795: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 795 
CTCTGGTAAC 

(2) INFORMATION FOR SEQ ID NO: 796: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 796 
TCTGGTAACT 



(2) INFORMATION FOR SEQ ID NO: 797: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 797 
CTGGTAACTA 
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(2) INFORMATION FOR SEQ ID NO: 798; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 798: 
TGGTAACTAG 



(2) INFORMATION FOR SEQ ID NO: 799: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 799: 
GGTAACTAGA 



(2) INFORMATION FOR SEQ ID NO: 800: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 800: 



TGGAAGGGCT 


AATTCACTCA 


CGGAAAAGAC 


CAGTTGAACC 


AGAAGAAGAT 


AGAAGAGGCC 


60 


ATGAAGAAGA 


AAACAACAGA 


TTGTTCTGCT 


TGCTCAGCTG 


GGGACTTTCC 


AGAAGGCGCG 


120 


GCCTGAGTGA 


CTAAGCCCCG 


TTGGGGACTT 


TCCGAAGAGG 


CATGAAGGGA 


CTTTCCAAGG 


180 


CAGGCGTGGC 


CTGGGCGGGA 


CTGGGGAGTG 


GCGAGCCCTC 


AGATGCTGCA 


TATAAGCAGC 


240 


TGCTTTCTGC 


CTGTACTGGG 


TCTCTCTGGT 


TAGACCAGAT 


CTGAGCCTGG 


GAGCTCTCTG 


300 


GCTAGCTAGG 


GAACCCACTG 


CTTAAGCCTC 


AATAAAGCTT 


GCCTTGAGTG 


CTTCAAGTAG 


360 


TGTGTGCCCG 


TCTGTTGTGT 


GACTCTGGTA 


TCTAGAGATC 


CCTCAGACCA 


TTTTAGTCCG 


420 


TGTGGAAAAT 


CTCTAGCAGT 


GGCGCCCGAA 


CAGGGACTTG 


AAAGCGAAAG 


GAAAACCAGA 


480 
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GGAGCTCTCT 


CGACGCAGGA 


CTCGGCTTGC 


TGAAGCGCGC 


ACGGCAAGAG 


GCGAGGGGCG 


540 


GCGACTGGTG 


AGTACGCCGA 


AAATTTTGAC 


TAGCGGAGGC 


TAGAAGGAGA 


GAGATGGGTG 


600 


CGAGAGCGTC 


AATATTAAGC 


GGGGGAAAAT 


TAGATAGATG 


GGAGAAAATT 


CGGTTAAGGC 


660 


CAGGAGGAAA 


GAAAAAGTAT 


AAATTAAAAC 


ATATAGTATG 


GGCAAGCAGG 


GAGCTAGAAC 


720 


GATTCGCAGT 


CAATCCTGGC 


CTGTTGGAAA 


CATCAGAAGG 


CTGTAGACAA 


ATACTGGGAC 


780 


AGTTACACCC 


GTCCCTTCAG 


ACAGGATCAG 


AAGAACTTAA 


ATCAGTATAT 


AATGCAGTAG 


840 


CAGTCCTCTA 


TTGTGTGCAT 


CAAAACATAG 


ACATAAAGGA 


CACCAAGGAA 


GCTTTAGAAA 


900 


AGATAGAGGA 


AGAGCAAAAC 


AAATGTAAGA 


AAAAAGCACA 


GCAAGCAGCA 


GCACAGCAAG 


960 


CAGCAGCTGG 


CACAGGAAAC 


AGCAACCCGG 


TCAGCCAAAA 


TTACCCTATA 


GTACAGAACA 


1020 


TGCAGGGGCA 


AATGGTACAT 


CAGGCCATAT 


CACCTAGAAC 


TTTAAATGCA 


TGGGTAAAAG 


1080 


TAATAGAAGA 


GAAGGCTTTC 


AGCCCAGAGG 


TAATACCCAT 


GTTTTCAGCA 


TTATCAGAAG 


1140 


GAGCCACCCC 


ACAAGATTTA 


AACACCATGC 


TAAACACAGT 


GGGGGGACAT 


CAAGCAGCTA 


1200 


TGCAAATGTT 


AAAAGAGACC 


ATCAATGAGG 


AAGCTGCAGA 


ATGGGATAGA 


TTACATCCAG 


1260 


CGCAGGCAGG 


GCCTGTTGCA 


CCAGGCCAGA 


TGAGAGACCC 


AAGGGGAAGT 


GACATAGCAG 


1320 


GAACTACTAG 


TACCCTTCAG 


GAACAAATAG 


GATGGATGAC 


AGGTAATCCA 


GCTATCCCAG 


1380 


TAGGAGAAAT 


CTATAAAAGA 


TGGATAATCC 


TGGGATTAAA 


TAAAATAGTA 


AGGATGTATA 


1440 


GCCCTATCAG 


CATTCTGGAC 


ATAAAACAAG 


GACCAAAGGA 


ACCCTTTAGA 


GACTATGTAG 


1500 


ACCGGTTCTA 


TAAAACTCTA 


AGAGCCGAGC 


AAGCTACACA 


GGAGGTAAAA 


AATTGGATGA 


1560 


CAGAAACCTT 


GTTGGTCCAA 


AATGCAAACC 


CAGATTGTAA 


GACTATTTTA 


AAAGCATTGG 


1620 


GACCAGCAGC 


TACACTAGAA 


GAAATGATGA 


CAGCATGTCA 


GGGAGTGGGA 


GGACCCAGCC 


1680 


ATAAAGCAAG 


AGTTTTGGCA 


GAAGCAATGA 


GCCAAGCAAC 


AAATGCAGCT 


ACTGTAATGA 


1740 


TGCAGAGAAG 


CAATTTTAGA 


AACCAAAGAA 


AGAATGTTAA 


GTGTTTCAAT 


TGTGGCAAAG 


1800 


AAGGGCACAT 


Ti "A ^ A TVrn 

AGCCACtAAA i 


1 IjC AooVjC i ^ 


p'pani^jia Attn 




AAATGTGGAA 


1860 


AGGAAGGACA 


CCAAATGAAA 


GATTGTACTG 


AGAGACAGGC 


TAATTTTTTA 


GGGAAAATCT 


1920 


GGCCTTCCCA 


CAAGGGGAGG 


CCAGGGAACT 


TTCTTCAGAG 


CAGGCCAGAA 


CCAACAGCCC 


1980 


CTCTCCAGGG 


CAGGCCGGAG 


CCATCAGCCC 


CGCCAGAAGA 


GAGCTTCAGG 


TTTGGGGAGG 


2040 


AGACAACAAC 


TCCCTCTCAG 


AAGCAGGAGC 


CGATAGACAG 


GGACAGGGAT 


CTGTATCCTT 


2100 


TAGCTTCCCT 


CAGATCACTC 


TTTGGCAACG 


ACCCCTCGTC 


ACAATAAAGA 


TAGGGGGGCA 


2160 


GCTGAAGGAA 


GCTCTATTAG 


ATACAGGAGC 


AGATGATACA 


GTATTAGAAG 


ACATGCATTT 


2220 


GCCAGGAAAA 


TGGAAACCAA 


AAATGATAGG 


GGGAATTGGA 


GGTTTTATCA 


AAGTAAAACA 


2280 
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ATATGATGAA 


ATTCTTGTAG 


AAATCTGTGG 


ACATAAAGCT 


ATAGGTACAG 


TATTAGTAGG 


2340 


ACCTACACCT 


GTCAACATAA 


TTGGAAGAAA 


TCTGTTGACT 


CAGATTGGTT 


GCACTTTAAA 


2400 


TTTTCCCATT 


AGTCCTATTG 


AAACTGTACC 


AGTACAATTA 


AAGCCAGGAA 


TGGATGGCCC 


2460 


AAAGGTTAAA 


CAATGGCCAT 


TGACAGAAGA 


GAAAATAAAA 


GCATTAGTAG 


AAATTTGTAC 


2520 


AGAAATGGAA 


AAGGAAGGAA 


AGATTTCAAA 


AATTGGGCCT 


GAAAATCCAT 


ACAATACTCC 


2580 


AGTATTTGCC 


ATAAAGAAAA 


AAGATGGTAC 


TAAATGGAGA 


AAATTAGTAG 


ATTTCAGAGA 


2640 


CCTTAATAAG 


AGAACTCAAG 


ACTTCTGGGA 


AGTTCAATTA 


GGAATACCAC 


ATCCCTCAGG 


2700 


ATTAAAAAAG 


AAAAAATCAG 


TAACAGTACT 


GGATGTGGGT 


GATGCATACT 


TTTCAGTTCC 


2760 


CTTAGATGAA 


AACTTCAGGA 


AGTATACTGC 


ATTTACCATA 


CCTAGTATAA 


ATAATGAGAC 


2820 


ACCAGGGATT 


AGATATCAGT 


ACAATGTGCT 


TCCACAGGGA 


TGGAAAGGAT 


CACCAGCAAT 


2880 


ATTCCAAAGT 


AGCATGACAA 


GAATCTTAGA 


GCCTTTTAGA 


AGACAAAATC 


CAGACATAGT 


2940 


TATCTATCAA 


TACATGGATG 


ACTTGTATGT 


AGGATCTGAT 


TTAGAAATAG 


GACAGCATAG 


3000 


AATAAAAATA 


GAGGAACTGA 


GACAACATCT 


GTTGAAGTGG 


GGATTTACCA 


CACCAGACAA 


3060 


AAAGCATCAG 


AAAGAACCCC 


CATTCCTTTG 


GATGGGTTAT 


GAACTCCATC 


CTGATAAATG 


3120 


GACAGTGCAA 


CCTATAGTAC 


TGCCAGAAAA 


AGACAGCTGG 


ACTGTCAATG 


ACATACAGAA 


3180 


GTTAGTGGGT 


AAATTAAATT 


GGGCAAGTCA 


GATTTACCCA 


GGAATTAAAG 


TAAGGCAATT 


3240 


ATGTAAACTC 


CTTAGGGGAA 


CCAAAGCACT 


AACAGAAGTA 


ATACCACTAA 


CAGAAGAAGC 


3300 


AGAGCTAGAA 


CTGGCAGAAA 


ACAGGGAAAT 


TCTAAGAGAA 


CCAGTACATG 


GAGTGTATTA 


3360 


TGACCCATCA 


AAAGACTTAA 


TAGCAGAAAT 


ACAGAAGCAG 


GAGCAAGGCC 


AATGGACATA 


3420 


TCAAATTTAT 


CAAGATCAAT 


TTAAAAATCT 


AAAAACAGGA 


AAGTATGCAA 


GATTGAGGGG 


3480 


TGCCCACACT 


AATGATGTAA 


AACAATTTCC 


AGAGGCAGTG 


CAAAAAATAG 


CCACAGAAAG 


3540 


CATAGTAATA 


TGGGGAAAGA 


CTCCTAAATT 


TAGACTACCC 


ATACAAAAAG 


AAACATGGGA 


3600 


CGCAtGGTGG 


ACAGAGTATT 


/-I /-t 7\ 

GGC AAGC U AC 








3660 


CCCTCCCCTA 


GTAAAATTAT 


GGTACCAGTT 


AGAAAAAGAA 


CCCATAATAG 


GAGCAGAAAC 


3720 


TTTCTATGTA 


GATGGGGCAG 


CTAACAGAGA 


GACTAAATTA 


GGAAAAGCAG 


GATATGTTAC 


3780 


TGACAGAGGA 


AGACAAAAAG 


TTGTCTCCCT 


AACTGACACA 


ACAAATCAGA 


AGACTGAGTT 


3840 


ACAAGCAATT 


€ATCTAGCTT 


TGCAGGATTC 


AGGATTAGAA 


GTAAACATAG 


TAACAGACTC 


3900 


ACAGTATGCA 


TTAGGAATCA 


TTCAAGCACA 


ACCAGATAAA 


AGTGAATCAG 


AAATAGTCAA 


3960 


TCAAATAATA 


GAGCAATTAA 


TAAAAAAGGA 


AAAGGTCTAC 


CTGGCATGGG 


TACCAGCACA 


4020 


CAAAGGAATT 


GGAGGGAATG 


AACAAGTAGA 


TAAATTAGTC 


AGTGCTGGAA 


TCAGGAAAAT 


4080 
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ACTATTTTTA 


GATGGAATAG 


ATAAGGCACA 


AGAAGGCCAT 


GAGAAATATC 


ACAGTAATTG 


4140 


GAGAGCAATG 


GCTAGTGGTT 


TTAACCTGCC 


ACCTATAGTA 


GCAAAAGAAA 


TAGTAGCCAG 


4200 


CTGTGATAAA 


TGTCAGCTAA 


AAGGAGAAGC 


CATGCATGGA 


CAAGTAGACT 


GTAGTCCAGG 


4260 


AATATGGCAA 


CTAGATTGTA 


CACATCTAGA 


AGGAAAAATT 


ATCCTGGTAG 


CAGTTCATGT 


4320 


AGCCAGTGGA 


TATATAGAAG 


CAGAAGTTAT 


TCCAGCAGAG 


ACAGGGCAGG 


AAACAGCATA 


4380 


CTTTATCTTA 


AAATTAGCAG 


GAAGGTGGCC 


AGTAAACACA 


ATACATACAG 


ACAATGGCGG 


4440 


CAATTTCATC 


AGTACCACGG 


TTAAGGCCGC 


CTGTTGGTGG 


GCAGGGATCA 


AGCAGGAATT 


4500 


TGGCATTCCC 


TACAATCCCC 


AAAGCCAAGG 


AGTAGTGGAA 


TCTATGAATA 


GAGAATTAAA 


4560 


GAAAATTATA 


GGACAGGTAA 


GAGATCAGGC 


TGAACATCTT 


AAGACAGCAG 


TACAAATGGC 


4620 


AGTATTCATC 


CACAATTTTA 


AAAGAAAAGG 


GGGGATTGGG 


GGATACAGTG 


CAGGGGAAAG 


4680 


AATAGTAGAC 


ATAATAGCAA 


CAGACATACA 


AACTAAAGAA 


TTACAAAAGC 


AAATTACAAA 


4740 


AATTCAAAAT 


TTTCGGGTTT 


ATTACAGGGA 


CAGCAGAGAT 


CCACTTTGGA 


AAGGACCAGC 


4800 


AAAACTTCTC 


TGGAAAGGCG 


AAGGGGCAGT 


AGTAATACAA 


GATAATAGTG 


ACATAAAAGT 


4860 


AGTGCCAAGA 


AGAAAAGTAA 


AGATCATTAG 


GGATTATGGA 


AAACAGATGG 


CAGGTGATGA 


4920 


TTGTGTGGCA 


AGTAGACAGG 


ATGAGGATTA 


GAACATGGAA 


CAGTTTAGTG 


AAACAGCATA 


4980 


TGTATGTTTC 


AAAGAAAGCT 


AAGGGATGGA 


TTTATAGACA 


TCACTATGAA 


AACACTCATC 


5040 


CAAAAATAAG 


CTCAGAAGTA 


CACATCCCAC 


TAGGGGAAGC 


TAGATTGGTA 


ATAACAACAT 


5100 


ATTGGGGTCT 


ACATACAGGA 


GAAAGAGACT 


GGCATTTGGG 


TCAGGGAGTC 


TCCATAGAAT 


5160 


GGAGGGAAAG 


GACATATAGA 


ACACAAGTAG 


ACCCCGAACT 


AGCAGACCAA 


CTAATTCATA 


5220 


TACATTACTT 


TGATTGTTTT 


TCAGAATCTG 


CCATAAGAAG 


TGCCATATTA 


GGATATAGAG 


5280 


TTAGGCATAG 


GTGTGAATAT 


CAAGCAGGAC 


ATAACAAGGT 


AGGATCTCTA 


CAGTACTTGG 


5340 


CACTAACAGC 


ATTAATAACA 


CCAAAGAAGA 


TAAAGCCACC 


TTTGCCTAGT 


GTTGCGAAAC 


5400 


TGACAGAGGA 


TAGATGGAAC 


AAGCC C C AGA 


AGACCAAGGG 




AviC U A i A\^AA 


jfi D U 


TGAATGGACA 


CTAGAACTTT 


TAGAGGAGCT 


TAAGAATGAA 


GCTGTTAGGC 


ATTTTCCTAG 


5520 


GGTATGGCTC 


CATGGCTTAG 


GGCAACATAT 


CTATGAAACT 


TATGGGGATA 


CTTGGGAAGG 


5580 


AGTGGAGGCC 


ATAACAAGAA 


CTCTGCAACA 


ACTGCTGTTT 


ATTCATTTCA 


GAATTGGGTG 


5640 


TCAACATAGC 


AGAATAGGCA 


TTATTCGACA 


GAGGAGAGCA 


AGAAATGGAG 


CCAGTAGATC 


5700 


CTAGACTAGA 


GCCCTGGAAG 


CATCCAGGAA 


GTCAGCCTAA 


GACTGCGTGT 


ACCACTTGCT 


5760 


ATTGTAAAAA 


GTGCTGCTTT 


CATTGCCAAG 


TTTGTTTTAT 


GACAAAAGGC 


TTAGGCATCT 


5820 


CCTATGGCAG 


GAAGAAGCGG 


AGACAGCGAC 


GAAGAGCTCC 


TCAAGACAGT 


CAGACTCATC 


5880 
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AAGCTTATCT 


ATCAAAGCAG 


TAAGTAATAT 


ATGTAATGCA 


ACCTTTACAA 


ATAGTAGCAA 


5940 


TAGTAGCATT 


AGTAGTAGCA 


GGAATAATAG 


CAATAGTTGT 


GTGGACCATA 


GTATTCATAG 


6000 


AATATAAGAA 


AATATTAAGA 


CAAAGAAAAA 


TAGACAGGTT 


GATTGATAGA 


ATAAGAGAAA 


6060 


GAGCAGAAGA 


CAGTGGCAAT 


GACAGTGAAG 


GGGATCAGGA 


AGAATTATCG 


GCACTTGTGG 


6120 


ACATGGGGCA 


CCATGATCCT 


TGGGATATTA 


ATGATCTGTA 


GAGCTGCAAA 


CAATTTGTGG 


6180 


GTCACAGTCT 


ATTATGGGGT 


ACCTGTGTGG 


AGAGAAGCAA 


CCACCACTCT 


ATTTTGTGCA 


6240 


TCAGATGCCA 


AGGCATATGA 


TGCAGAGGTA 


CATAATGTTT 


GGGCCACACA 


TGCCTGTGTA 


6300 


CCCACAGACC 


CTAACCCACA 


AGAAGTAGAA 


TTGAAAAATG 


TGACAGAAAA 


TTTTAACATG 


6360 


TGGAAAAATA 


ACATGGTAGA 


ACAGATGCAT 


GAGGATATAA 


TCAGTTTATG 


GGATCAAAGC 


6420 


CTGAAGCCAT 


GTGTAAAATT 


AACCCCACTC 


TGTGTTTCTT 


TAAATTGCAC 


TGATGCTACT 


6480 


AATACCACTA 


ATAGTAATAC 


CACTAGCAGC 


AGCGAGAAAC 


CGAAGGGGAC 


AGGGGAAATA 


6540 


AAAAACTGCT 


CTTTCAATAT 


CACCACAAGC 


ATAAGAGATA 


AGGTGCAGAA 


ACAATATGCA 


6600 


CTTTTTTATA 


GCCTTGATGT 


AGTACCAATG 


GATGATAATG 


ATAATAGTAC 


AAGCTATAGG 


6660 


TTAATAAGTT 


GTAACACCTC 


AATCATTACA 


CAGGCCTGTC 


CAAAGATATC 


CTTTGAGCCA 


6720 


ATTCCCATAC 


ATTATTGTGC 


CCCGGCTGGT 


TTTGCGATTC 


TAAAGTGTAA 


AGATAAAAGG 


6780 


TTCAATGGAA 


AAGGACCATG 


TACAAGTGTC 


AGCACAGTAC 


AGTGTACACA 


TGGAATTAGG 


6840 


CCAGTAGTAT 


CAACTCAACT 


GTTGTTAAAT 


GGCAGTCTAG 


CAGAAGAAGA 


GGTAGTAATT 


6900 


AGATCTGACA 


ATTTTACGAA 


CAATGCTAAA 


ACCATAATAG 


TACAGCTGAG 


CAAATCTGTA 


6960 


GAAATTACTT 


GTGTAAGACC 


CAACAACAAT 


ACAAGAAAAA 


GTATAAGTAT 


GGGACCAGGG 


7020 


AGAGCATTTT 


ATACAACAGA 


AATAATAGGA 


GATATAAGAC 


AAGCATATTG 


TAACATTAGT 


7080 


AAAGCAAACT 


GGACTGACAC 


TTTAGAACAG 


ATAGCTAGAA 


AATTAAGAGA 


ACAATTTGAG 


7140 


AATAAAACAA 


TAGTCTTTAA 


GCCATCCTCA 


GGAGGGGACC 


CAGAAATTGT 


AACACAGTTT 


7200 


TACAGTTTTA 


ATTGTGGAGG 


GGAATTTTTC 


TACTGTAATT 


CAACACAACT 


GTTTAATGGT 


T o ^ n 
/zbU 


ACTTGGAATG 


GTACTTGGGT 


TAATGGTACT 


TGGAGTAGTA 


ATAATACGAC 


TGATACTGCA 


7320 


AATATCACAC 


TCCCATGCAG 


AATAAAACAA 


TTTATAAACA 


TGTGGCAGGA 


AGTAGGAAAA 


7380 


GCAATGTATG 


CCCCTCCCAT 


CAAAGGACAA 


ATTAAATGTA 


CATCAAATAT 


TACAGGGCTG 


7440 


ATATTAACAA 


GAGATGGTGG 


TAACAATAAC 


ACCACGAACG 


ACAACGAGAC 


CGAGACCTTC 


7500 


AGACCTGGAG 


GAGGAGATAT 


GAGGGACAAT 


TGGAGAAGTG 


AATTATATAA 


ATATAAAGTA 


7560 


GTACAAGTTG 


AACCATTAGG 


AGTAGCACCC 


ACCAAGGCAA 


AGAGAAGAGT 


GGTGCAAAGA 


7620 


GAAAAAAGAG 


CAGTGGGAAT 


AGGAGCTATG 


TTCCTTGGGT 


TCTTAGGAGC 


AGCAGGAAGC 


7680 
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ACTATGGGCG 


CAGCGTCAGT 


GACGCTGACG 


GTACAAGCCA 


GACAATTATT 


GTCTGGTATA 


7740 


GTGCAGCAGC 


AGAACAATCT 


GCTGAGGGCT 


ATTGAGGCGC 


AACAGCATCT 


GTTGCAACTC 


7800 


ACAGTCTGGG 


GCATCAAACA 


GCTCCAGGCA 


AGAGTCCTGG 


CTGTGGAAAG 


ATACCTAAGG 


7860 


GATCAACAGC 


TCCTGGGACT 


TTGGGGTTGC 


TCTGGAAAAC 


TCATTTGCAC 


CACTACTGTG 


7920 


CCTTGGAACA 


ATAGCTGGAG 


TAATAAATCT 


CTGGAAACAA 


TTTGGGATAA 


CATGACCTGG 


7980 


ATGCAGTGGG 


AAAGAGAAAT 


TGACAATTAC 


ACAAACATAA 


TATACACCTT 


AATTGAAGAA 


8040 


TCGCAGAACC 


AACAAGAAAA 


AAATGAACTA 


GAATTATTGG 


AATTGGATAA 


ATGGGCAAAT 


8100 


TTGTGGAATT 


GGTTTAGTAT 


ATCAAACTGG 


CTATGGTATA 


TAAAATTATT 


CATAATGGTA 


8160 


GTAGGAGGCT 


TGGTAGGTTT 


AAGAATAGTT 


TTTACTGTAC 


TTTCTATAGT 


TAATAGAGTT 


8220 


AGGCAGGGAT 


ACTCACCATT 


ATCGTTTCAG 


ACCCACCTCC 


CAACCCCGAA 


GGGACCCGAC 


8280 


AGGCCAGAAG 


GAATCGAAGA 


AGAAGGTGGA 


GAGAGAGACA 


GAGGCAGCTC 


CACTCGATTA 


8340 


GTGCACGGAT TCTTAGCACT TTTCTGGGAC GACCTGAGGA GTCTGTGCCT CTTCAGCTAC 


8400 


CACCACTTGA 


GAGACTTACT 


CTTGATTGTA 


ACGAGGATTG 


TGGAACTTCT 


GGGACGCAGG 


8460 


GGATGGGAAG 


CCCTCAAATA 


CTGGTGGAAT 


CTCCTGCAGT 


ATTGGAGGCA 


GGAACTACAG 


8520 


AAGAGTGCTG 


TTAGCTTGTT 


CAATGGCACG 


GCCATAGCAG 


TAGCTGAGGG 


GACAGATAGA 


8580 


GTTATAGAAG 


CTTTACGAAG 


GGCTTATAGA 


GCTATTCTCC 


ACATACCTAG 


AAGAATAAGA 


8640 


CAGGGCTTAG 


AAAGGGCTTT 


GCTATAAAAT 


GGGTGGCAAG 


TGGTCAGAAA 


GTAGTGTGGT 


8700 


TAGAAGGCAT 


GTACCTTTAA 


GACAAGGCAG 


CTATAGATCT 


TAGCCGCTTT 


TTAAAAGAAA 


8760 


AGGGGGGACT 


GGAAGGGCTA 


ATTCACTCAC 


GGAAAAGACC 


AGTTGAACCA 


GAAGAAGATA 


o o o n 


GAAGAGGCCA 


TGAAGAAGAA 


AACAACAGAT 


TGTTCTGCTT 


GCTCAGCTGG 


GGACTTTCCA 


8880 


GAAGGCGCGG 


CCTGAGTGAC 


TAAGCCCCGT 


TGGGGACTTT 


CCGAAGAGGC 


ATGAAGGGAC 


8940 


TTTCCAAGGC 


AGGCGTGGCC 


TGGGCGGGAC 


TGGGGAGTGG 


CGAGCCCTCA 


GATGCTGCAT 


9000 


ATAAGCAGCT 


GCTTTCTGCC 


TGTACTGGGT 


CTCTCTGGTT 


AGACCAGATC 


TGAGCCTGGG 


9060 


AGCTCTCTGG 


CTAGCTAGGG 


AACCCACTGC 


TTAAGCCTCA 


ATAAAGCTTG 


CCTTGAGTGC 


9120 


TTCAAGTAGT GTGTGCCCGT CTGTTGTGTG ACTCTGGTAT 


CTAGAGATCC 


CTCAGACCAT 


9180 


TTTAGTCCGT 


GTGGAAAATC 


TCTAGCA 








9207 



(2) INFORMATION FOR SEQ ID NO: 801: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:801: 



Asn Thr Ser Leu Leu His Pro Val Ser Leu His Gly Met Asp Asp Pro Glu 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 802: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 802: 



ACCAGCTTGT TACACCCTGT GAGCCTGCAT GGAATGGATG ACCCTGAG 



(2) INFORMATION FOR SEQ ID NO: 803: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:803: 
ACCAGCTTGT 



(2) INFORMATION FOR SEQ ID NO: 804: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 804: 
CCAGCTTGTT ' 



(2) INFORMATION FOR SEQ ID NO: 805: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 805 
CAGCTTGTTA 



(2) INFORMATION FOR SEQ ID NO: 806: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 806 
AGCTTGTTAC 



(2) INFORMATION FOR SEQ ID NO: 807: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 807 
GCTTGTTACA 



(2) INFORMATION FOR SEQ ID NO: 808: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 808 



CTTGTTACAC 



(2) INFORMATION FOR SEQ ID NO: 809: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 809 
TTGTTACACC 



(2) INFORMATION FOR SEQ ID NO: 810: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 810 
TGTTACACCC 



(2) INFORMATION FOR SEQ ID NO: 811: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 811 
GTTACACCCT . 



(2) INFORMATION FOR SEQ ID NO: 812: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 812 
TTACACCCTG 



(2) INFORMATION FOR SEQ ID NO: 813: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 813 
TACACCCTGT 



(2) INFORMATION FOR SEQ ID NO: 814: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 814 
ACACCCTGTG 



(2) INFORMATION FOR SEQ ID NO: 815: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 815 
CACCCTGTGA 



(2) INFORMATION FOR SEQ ID NO: 816: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
(G) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 816 
ACCCTGTGAG 



(2) INFORMATION FOR SEQ ID NO: 817: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 817 
CCCTGTGAGC 



(2) INFORMATION FOR SEQ ID NO:818: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 818 
CCTGTGAGCC 



(2) INFORMATION FOR SEQ ID NO: 819: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 819 
CTGTGAGCCT . 



(2) INFORMATION FOR SEQ ID NO: 820: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEOUENCE DESCRIPTION: SEQ ID NO: 820 
TGTGAGCCTG 



(2) INFORMATION FOR SEQ ID NO: 821: 



-317- 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:821 
GTGAGCCTGC 



(2) INFORMATION FOR SEQ ID NO: 822: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 822 
TGAGCCTGCA 



(2) INF0R14ATI0N FOR SEQ ID NO: 823: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 823 
GAGCCTGCAT 



(2) INFORMATION FOR SEQ ID NO: 824: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 824 
AGCCTGCATG 



(2) INFORMATION FOR SEQ ID NO: 82 5: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 825 
GCCTGCATGG 



(2) INFORMATION FOR SEQ ID NO: 826: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 826 
CCTGCATGGA . 



(2) INFORMATION FOR SEQ ID NO: 827: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 827 
CTGCATGGAA 



(2) INFORMATION FOR SEQ ID NO: 82 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 828 
TGCATGGAAT 



(2) INFORMATION FOR SEQ ID NO: 829: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 829 
GCATGGAATG 



(2) INFORMATION FOR SEQ ID NO: 830: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
(G) STRANDEDNESS: single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 830 
CATGGAATGG 



(2) INFORMATION FOR SEQ ID NO: 831: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 831 
ATGGAATGGA 



(2) INFORMATION FOR SEQ ID NO: 832: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 832 
TGGAATGGAT 



(2) INFORMATION FOR SEQ ID NO: 833: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 833 
GGAATGGATG 



(2) INFORMATION FOR SEQ ID NO: 834: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 834 
GAATGGATGA 



(2) INFORMATION FOR SEQ ID NO: 835: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 835 
AATGGATGAC 



(2) INFORMATION FOR SEQ ID NO: 836: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 836 
ATGGATGACC 



(2) INFORMATION FOR SEQ ID NO: 837: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 7 
TGGATGACCC 



(2) INFORMATION FOR SEQ ID NO: 838: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
iQ) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 838 
GGATGACCCT 



(2) INFORMATION FOR SEQ ID NO: 839: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 839 
GATGACCCTG 



(2) INFORMATION FOR SEQ ID NO: 840: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 840 
ATGACCCTGA 



(2) INFORMATION FOR SEQ ID NO: 841: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 841 



TGACCCTGAG 
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CLAIMS: 

1. An isolated HTV-l strain or a component, part, fragment «r derivative ibercof 
which is substantially non-pathogenic. 

2. A strain of HIV-1 according to claim 1 wherein said strain is capable of 
stimulating an immune response in a human or primate against at leart one glycoprotein 
Ott mV-l whilst not substantially reducing proliferative responses and cytokine 
production to a mitogen in said human or primate compared to a healthy, non-infected 
human or primate suttjeet. 

~ 3, A strain of HIV-1 according to claim 2 whidicarrie««ie<M- more mutations in 

^ its genome resulting in the inabiUty to direct synthesis of at least one polypeptide or 

J protein from a pathogenic strain of HTV-l or results in reduced qmthesis of said 

polypqjtide or protein. 

4 A. strain of HIV>1 according to claim 2 ^lvhich carries one or more mutations in 
its genome resulting in tfte said gemmie direetiB^ syndiesis of a truncated form of a 
polypeptide or protein from a padic^enic strain of HTV-l. 

5. A strain of HTV-l according to claim 3 or 4 wherein said strain carries a 
mutation In its ne/gene and/or long terminal repeat (LTR> region or in a functionaUy 
equivalent location in the HIV-1 genome. 

6. A strain of HIV-1 according to claim 5 wherein said strain is reactive to 
antibodies to gp41-45, gpl20 and/or gpl60 of fflV-1 and carries a deletion of at least 
ten contiguous nucleotides in a region corresponding to the »w/genc and/or LTR region 
of pathogenic IIIV-1. 

7. A stiain of HIV-1 according to claim 6 wherein said strain encodes a modified 
Nef protein without amino acids 162 to 177 of wild-type HIV-Inl4j Nef. 
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8. A strain of MV-l according to claim Sot! yihcmxk said straia is c^iable of 
inducing an immune re^MMise to at least one otgag, pot andAv tnv, 

d. A strain of HIV-1 according to claim S wherein said Mo cnries a deletion in 
its genome of at least 10 nucleotides within the legion fiom nucleotide 8787 to 
nucleotide 9709 using the nucleotide numbering of HIV-1i4L43. 

10. A strain of HIV-1 according to claim 9 wherein said strain carries a deletion in 
its genome of at least 10 nucleotides from widiin a regioa selected from the Ust 
consisting of: 

nucleotide (0 8830-8862; 

(U) 9009-9035; 
Oil) 9019-9029; and 

(iv) 9033-9049. 

11. A strain of HIV-1 according to claim 9 wherein said strain carries a deletion in 
its genome of at least 10 nucleotides from within a region selected from tiie list 
consisting of: 

nucleotide (v) 9281-9371; 

(vO 9281-9362; 
(vii) 910S-9224; and 

(viiO 9271-9370. 



12. A Strain of HW-1 according to claim 9 wherein said atraincarfiei a deletion in 
its genome of at least 10 nucleotides from witiun a region aeleded from the list 
consisting of: 

nucleotide (ix) 8882-892g; 

(x) 8850-9006; 
(») 8792-9041; and 

(xii) 9112-9204. 
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13. A strain of lUV-l according to claim 9 viicrein said ilnin cnies a deletion in 
its genome of at least 10 nucleotides from within a regkm aefedod ftom ^ list 
consisting of: 

nucleotide (xiix) 9105-9224 

(xiv) 9389-9395; and 

(xv) 9281-9366. 



14. A strain of HIV-1 according to claim 9 wherein ntd ddetion encompasses at 
least one of the following decanucleotides firom tfie nef geoe of HIV- 1^^: 





ATGGGTGGCA 


(SEQ 


ID 


NO: 2) ; 


rOGGTGGCAX 


(m 


ID NO: 3) ; 






aOOTOGCAAG 


(SEQ 


ID 


K0:4) I 


GGTOOCAAGT 


(«BQ 


I D NO 


:5) ; 




J— 






lu 


KO:6) 1 


TGOCAAOTOa 


(SEG 


ID 


NO: 7) ; 




GGCAAGTQQT 


(SEO 


ID 


NO: 8) ; 


GKAAGTGGTC 


(ABQ 


ZD NOt9) ; 






CAAOTGGTCA 


(SEQ 


ID 


NO: 10) 


; AAOTOGTCAA 


(SBQ 


ID 


NO:ll) 


9 




AGTGGTCAAA 


(SEQ 


ID 


NO: 12) 


crraoTCAAAA 


(SEQ 


ID 


NO:13) 


; 




TGGTCAAAAA 


(SEQ 


ID 


NO: 14) . 


GOTCAAAAAG 


(SEQ 


ID 


liO:lS) 






GTCAAAAAQT 


(SEQ 


ID 


NO:l«) . 


TCAAAAAGTA 


(SBQ 


ID 


NO: 17) 


; 




CAAAAAGTAO 


(SEQ 


ID 


NO: IS) , 


AAAAAaTACrr 


(SCQ ID HO 


:1»> ; 






AAAAOTAQT6 


(SBQ 


ID 


NO:20) . 


AAIUITACITGT 


(SBQ 


ID 


110:21) 


; 




AAGTAaTQTG 


(SEQ 


ID 


N0:22) ; 


AOTAGTOTGA 


(SEQ 


ID 


IIO:23) 


? 




GTAOTGTOAT 


(SEQ 


ID 


NO: 24) ; 


TAOraTGATT 


(SBQ ID NO:25) ; 






AGTOTOATTG 


(SEQ 


ID 


NO:26) ; 


OTOTOATTGO 


(SEQ 


ID 


NO:27) 






TGTCa^TTGOA 


(SEQ 


ID 


NO:2S) J 


GrraATTGOAT 


(SBQ 


ID 


NO: 29) 






TGATTGH3ATG 


(SEQ 


ID 


NO:30) ; 


GATTGGATGG 


(SEQ 


ID 


NO: 31) 






ATTGGATGGC 


(SBQ 


ID 


NO: 32) ; 


ITOQATGGCC 


(SEQ 


ID 


NO: 33) 






TGCSATOaCCT 


(SEQ 


ID 


NO : 34 ) ; GCATGGCCTG 


(SBQ 


ID 


tlO:35) 






OATOGCCTQC 


(SEQ 


ID 


NO : 36 ) ; ATOGCCTGCT 


(8SQ 


ID 


NO:37) 


; 




TGGCCTGCTG 


(SEQ 


ID 


NO: 38); GOCCTGCTGT 


(SBQ 


ID 


110:39) 


i 




GCCTGCTGTA 


(SEQ 


ID 


NO:40) ; 


CCTGCTGTAA 


(SEQ 


ID 


N0:41) 


; 




CTGCTGTAAa 


(SEQ 


ID 


NO: 42); TGCTGTAAGQ 


(SEQ 


ID 


NO:43) 


; 




GCTGTAAGGG 


(SEQ 


ID 


NO: 44); CTQTAAGGGA 


(SEQ 


ID 


110:45) 


; 




TGTAAGGGAA 


(SEQ 


ID 


N0:46) ; 


GTAAGCSGAAA 


(SEQ 


ID 


NO: 47) 






TAAGGGAAAG 


(SEQ 


ID 


NO:48) ; 


AAGGGAAAGA 


(SEQ 


ID 


NO:49) 


; 




A(3GGAAAGAA 


(SBQ 


ID 


NO: 50); GGOAAAGAAT 


(SBQ 


ID 


NO:5l) 






GGAAAGAATG 


(SEQ 


ID 


NO : 52 ) ; GAAAGAATGA 


(SBQ 


ID 


NO:53) 
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AAAGAATGAG (SBQ ID NO: 54); 
AGAATGAGAC (SSQ ID NO: 56); 
AATOAGACOA (SEQ ID NO: 58); 
TQAGAC6AGC (SEQ ID NO>60); 
AGACGAaCTO (SEQ ID NO: 62); 
ACXSAOCTGAG (SEQ ID NO: 64); 
aAaCTOAGCC (SBQ ID N0:S6); 
GCTGAGCCAG (SEQ ID NO:$8); 
TGaVGCCAOGA (SBQ ID NO: 70); 
AGCCAGCA6C (SEQ ID NO: 72); 
CCAOCAGCAQ (SEQ ID NO: 74); 
AGCAflCAGAT (SEQ ID HO: 76); 
CAeCAGATOO (SBQ ID NO:78) ; 
QCAQATaOGO (SEQ ID NO:80) ; 
AGATGGGGTG (SEQ ID NO:82) ; 
ATGGGGTGGG (SBQ ID NO:84) ; 
GGGGTGG6AG (SBQ ID NO:8«); 
GGTGGOAGCA (SEQ ID llO:88); 
TGGGAOCAGT (SBQ ID NO:90)| 
GGAGCAOTAT (SBQ ID NO: 92); 
AOCAGTATCT (SEQ ID NO: 94); 
CA0TATCT06 (SEQ ID NO: 96); 
QTATCTC3GAO (SEQ ID NO: 98); 
ATCTCGAGAC (SEQ ID NO; 100); 
CTCaAOACCT (SBQ ID NO:102) ; 
CGAGACCTAG (SEQ ID NO: 104); 
AGACCTAGAA (SEQ ID NO:10€) ; 
ACCTAGAAAA (SEQ ID NO: 108); 
CTAQAA3UVAC ( SBQ ID NO : 11 0 ) ; 
AGAAAAACAT (SEQ ID N0:112) ; 
AAAAACATGQ (SEQ ID NO: 114); 
AAACATGGAQ (SEQ ID NO: 116); 
ACATGGAGCA (SEQ ID NO: 118); 
ATOGAGC^AT (SEQ ID N0:120) ; 
QGAGCAATCA (SEQ ID NO: 122); 
AGCAATCACA. (SEQ ID NO: 124) ; 
CAATCACAAG (SEQ ID NO: 126) ; 
ATCACAAGTA (SEQ ID NO: 128) ; 
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AAGAATOAGA 


(SEQ 


ID 


NO: 55) 


OAATQAGACa 


(SEQ 


ID 


NO:57) 


ATGAGACX3Aa 


(SEQ 


ID 


N0:59) 


QAOACQAaCT 


(SEQ 


ID 


NO:61) 


GACGAGCTGA 


(SBQ 


ID 


NO:63) 


OOAOCTGAGC 


(8BQ 


ID 


NO:65) 


AOCTGAOCCA 


(SBQ 


ID 


NO:67) 


CTGAGCCAQC 


(SEQ 


ID 


NO: 69) 


GAOCCAGCAa 


(SBQ 


ID 


NO: 71) 


OCXAOCAGCA 


(SBQ 


ID 


NO:73) 


CAOGAOCAaA 


(SBQ 


ID 


NO: 75) 


OGAOCAaATO 


(SBQ 


ID 


NO:77) 


AOCAGATGGO 


(SBQ 


ID 


NO:79) 


C3U3ATGGGGT 


(SEQ 


ID 


NO:81) 




(8BQ 


ID 


NO:83) 


TdaaQTaGOA 


(SEQ 


ZD 


NO:85) 


ooOTaaaAGC 


(SEQ 


ID 


llOt87) 


OTGOGAaCAa 


(SBQ 


ID 


SK>:89) 


OOKWaCAGTA 


(«SQ 


ID 


MO: 91) 


GAOCAOTATC 


(SEQ 


ID 


NO:93) 


OGMiTATCTC 


(SEQ 


ZO 


MO: 95) 


ASTATCTGOA 


(SBQ 


ID 


MO: 97) 


TATCTCaAOA 


(SEQ 


ID 


MO: 99) 


TCTG6AGACC 


(SEQ 


ZD NOilOl) 


TOSAGACCTA 


(SBQ 


ID NOi 


El03) ; 


GAGACCTAGA 


(SEQ 


ID NO: 


!105) ; 


QACCTAGAAA 


(SEQ 


ZD NO. 


!l07) ; 


CCTAOAAAAA 


(SBQ 


ZD MO 


il09) ; 






ZD MO 


:111) ; 




ISBO 


ID NO 


:113); 


XXXACATGGA 


(8B0 


ID NO 


:115); 




(SEO 


ID NO 


:117) ; 


CATGGAGCAA 


(SSQ 


ID NO 


:119) ; 


TOGAGCAATC 


(SBQ 


ZD NO 


:121) ; 


GAGCAATCAC 


(SEQ 


ID NO 


:123) ; 


GCAATCACAA 


(SEQ 


ID NO 


:125); 


AATCACAAGT 


(SEQ 


ID NO 


:127) ; 


TCACAAGTAO 


(SEQ 


ID NO 


:129) ; 
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CAAGTAOCAA 

AGTAGCAATA 

TAGCAATACA. 

GCAATACAOC 

AATACASCAS 

TACAOCA3CT 

CAGCAOCTAA. 

GCAGCTAACA 

AGCTAACAAT 

CTAACAATGC 

AACAATOCTO 

CAATOCTGCT 

ATGCTQCTTG 

GCTGCTTGTa 

TGCTTGTCfCC 

CTTGTG^CTG 

TOTGCCTGGC 

TGCCTQOCTA 

CCTOGCTAGA 

TOOCTAOAAG 

aCTAGAASCA 

TAOAAGCACA 

GAAGCACAAG 

AGCACAAGAG 

CACAAGAG6A 

CAAGAGGAOG 

AGAGOAGOAA 

AGGAGGAAGA 

GAGGAAGAGG 

GGAAGAGOTG 

AAGAGOTGGG 

GAGQTGGGTT 

GGTGGGTTTT 

TGGQTTTTCC 

GGTTTTCCAa 

TTTTCCAGTC 

TTCCAGTCAC 



(SSQ ZD NO; 130 
(SEQ ID NO: 132 
(SBQ ID NO: 134 
(SEQ ID NO: 136 
(SEQ ID NO: 138 
(SSQ ID NO: 140 
(SBQ ID NO: 142 
(3BQ ID NO: 144 
(SEQ ID NO: 146 
(SEQ ID NO: 148 
(SEQ ID NO: 150 
(SEQ ID NO: 152 
(SEQ ID NO: 154 
(SEQ ID NO: 155 
(SEQ ID NO:lSe 
(SEQ ID NO: 160 
(SEQ ID NO: 162 
(SEQ ID NO: 164 
(SEQ ID NO: 166 
(SEQ ID NO: 168 
(SEQ ID NO: 170 
(SEQ ID NO :17a 
(SEQ ID NO: 174 
(SEQ ID NO: 176 
(SEQ ID NO: 178 
(SEQ ID NO: 180 
(SEQ ID NO: 182 
(SEQ ID NO i 184 
(SEQ ID NO: 18 6 
(SEQ ID NO: 188 
(SEQ ZD NO: 190 
(SEQ ID NO: 192 
(SEQ ID NO: 194 
(SEQ ID NO: 196 
(SEQ ID NO: 198 
(SEQ ID NO: 200 
(SEQ ID NO 1202 
(SEQ ID NO: 204 



AC3UIGTAGCA 
; AAOTAGCAAT 
GTAGCAATAC 
AOCAATACAG 
CAATACAGCA 
ATACAOCAOC 
i ACA3C!AOCrA 
; AGCAQCTAAC 
; CAOCTAACAA 
; OCTAACAATG 
TAACAATGCT 
f ACAATGCTGC 
; AATOCTQCTT 
; TaGTGCTTGT 
; CTaCTTGTGC 
; OCrPOTGCCT 
; TTOTOCCTaO 
; GTGCCTGGCT 
GCCTGOCTAO 
CTOaCTAOAA 
GOCTAOAAGC 
CnUSAAOCAC 
J AOAAGCACAA 
; AASCACAAGA 
/ GGACAAOAGG 
ACAAGAQGAa 
AAOAGOAGGA 
I GAOOAGGAAG 
; aOMaOAAGAG 
; AiaOAAGAGGT 
t OAAOJIGKSTGO 

; AOQTGGGTTT 

; GTQGQTTTTC 

; GCKSTTTTCCA 

. (3TTTTCCAGT 

; TTTCCAGTCA 

; TCCAGTCACA 



SBQ ID NOil31) ; 

8BQ ID NO: 133) ; 

SBQ ID NO: 135) ; 

SBQ ID NO: 137) ; 

SBQ ID NO: 139) ; 

SBQ ID MO>141) ; 

SBQ ID NO:143) ; 

8EQ ID NO: 145} ; 

SBQ ID N0:147) ; 

SBQ ID NO: 149) ; 

SBQ ID NO: 151) ; 
8»Q n> NO:153) ; 
SBQ ID NO: 155) ; 
mQ ID NOslS?) ; 
«BQ ID NO: 159) ; 
SBQ ID NO: 161) ; 
«BSQ ID MO: 163) ; 
SBQ ID NO: 165) ; 
SBQ ID NO: 167) ; 
SBQ ID MO: 169) ; 
SBQ ID MO: 171) ; 
SBQ ID M0tl73) ; 
SBQ ID MO: 175) ; 
SBQ ID MO:177) i 
SBQ ID NO: 179) ; 
SEQ ID MO: 181) ; 
SBQ ID MO: 183) ; 
SBQ ID MOtl8S) ; 
SBQ ID NO: 187) ; 
SBQ ID NO:189) ; 
SBQ ZD IIO!l91) ; 
SBQ ID 110:193) ; 
SBQ ID 110(195) ; 
SBQ ZD NO: 197) ; 
SBQ ZD NO: 199) ; 
SBQ ID NO; 201) ; 
SEQ ID MO:203} ; 
SEQ ID NO: 205); 
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CCAGTCACAC 


(SSQ 


ID 


NO:206) ; 


CM/TCACACC 


(SEQ 


ID 


NO: 207] 


* 


AGTCACACCT 


(SEQ 


ID 


NO:208) ; 


OTCACACGTC 


(SEQ 


ID 


NO:209) 


i 




(SEO 


ID 


NO: 210) ; 


CXCACCTCAjO 


(SEQ 


ID 


N0:211) 


» 


ACACCTCAGG 


(SEQ 


ID 


NO: 212) ; 


CUCCTCAflOT 


(8BQ 


ID 


MO:213) 




ACCTCAGGTA 


(SEQ 


ID 


NO: 214) ; 


CCTCAGGTAC 


(SEQ 


ID 


NO:21S} 


; 


CTCAGOTACC 


(SEQ 


ID 


NO: 216) ; 


TCaiaGTACCT 


(SEQ 


ID 


NO:217) 




CAGGTACCTT 


(SEQ 


ID 


NO: 218) ; 


AOGTACCTTT 


(SBQ 


ID 


NO:219) 


7 


QGTACCTTTA 


(SEQ 


ID 


NOt22p) ; 


OTACCTTTAA 


(SEQ 


ID 


NO:221) 


J 


TACCTTTAAG 


(SEQ 


ID 


NO:222) ; 


ACCTTTAAOA 


(SBQ 


ID 


NO:223) 




CCTTTAAQAC 


(SEQ 


ID 


NO: 224) ; 


CTTrAAOACC 


(SSQ 


ID 


NO:225) 


- 


TTTAAGACCA 


(SEQ 


ID 


NO: 226) ; 


TTAAGACCAA 


(SBQ 


ID 


NOi227) 




TAAGACCAAT 


(SEQ 


ID 


NO:228) ; 


AAGACCAATO 


(SEQ 


ID 


NO: 229) 


• 


AGACC2VATGA 


(SEQ 


ID 


NO: 230) ; 


OACCAATGAC 


(SBQ 


ID 


NO:231) 




ACCAATGACT 


(SEQ 


ID 


NO:2a2) ; 


CCfcATOACTT 


(SBQ 


ID 


NO: 233) 


i 


CAATGACTTA 


(SEQ 


ID 


NO: 234) ; 


AATOACTTAC 


(SBQ 


ID 


NO:235) 


; 


ATGACTTACA 


(SBQ 


ID 


NO: 236) ; 


TGACTTACAA 


(SBQ 


ID 


MO:237) 


; 


GACTTACAAG 


(SEQ 


ID 


NO: 238) ; 


ACTTACAAG6 


(SBQ 


ID 


MO:239) 


; 


CTTACAAGGC 


(SEQ 


ID 


NO: 240) ; 


TYACSAAGOCA 


(SBQ 


ID 


NOt241) 


/ 


TACAAGGCAG 


(SEQ 


ID 


NO: 242) ; 


ACaUWGGCAGC 


(SBQ 


ID 


NO: 243) 


; 


CAAGGCAGCT 


(SEQ 


ID 


NO: 244) ; 


AA99CACCT6 


(SBQ 


ID 


190:245) 


; 


AGOCAGCTOT 


(SEQ 


ID 


NO: 246) ; 


aOGAOCTOTA 


(SBQ 


ID 


NO:247) 


; 


GCAGCTGTAO 


(SEQ 


ID 


NO:248) } 


CA9GTGTA6A 


(SBQ 


ID 


110:249) 


; 


AGCTQTAGAT 


(SEQ 


ID 


NO:250) ; 


GSCTGTAOATC 


(SBQ 


ID 


NO:251) 


• 


CTGTAGATCT 


(SEQ 


ID 


NO: 252) ; 


TGTAGATCTT 


(SEQ 


ID 


NO:2S3) 


; 


GTAGATCTTA 


(SEQ 


ID 


NO: 254) ; 


TAGATCTTAO 


(SBQ 


ID 


M0:2S5) 


? 


AGATCTTAGC 


(SEQ 


ID 


NO: 256) ; 


OATGTTAGCC 


(SEQ 


ID 


NO:257) 


; 


ATCTTAGCCA 


(SEQ 


ID 


NO: 258) ; 


TCTTAOCCAC 


(SBQ 


ID 


NO: 259) 




CTTAGCCACT 


(SEQ 


ID 


NO: 260) ; 


TTAOCCACTT 


(SBQ 


ID 


MO:261) 




TAGCCACTTT 


(SEQ 


ID 


NO: 262) ; 




(BBy 


ID 


NO:263) 




GCCACTTTTT 


(SEQ 


ID NO: 264) ; 


iXivSrLTrLA 




ID NO:265) 




CACTTTTTAA 


(SEQ 


ID 


NO: 266) ; 


AX^WSri i' AAA 




ID 


N0!267) 




liAAAA 


(SEQ 


ID 


NO: 268) ; 


1 ITl X LAJUKfv^ 


\oa\l 


ID 


110:269) 


• 

i 


TTTTAAAA6A 


(SEQ 


ID NO: 270) ; 


TTTAAAAGAA 


(SBQ 


ID NO: 271) 


} 


TTAAAAGAAA. 


(SSQ 


ID 


NO: 272) ; 


TAAAAGAAAA 


(SBQ 


ID 


NO: 273) 


I 




(SEQ 


ID 


NO: 274) ; 


AAAGAAAAGG 


(SBQ 


ID 


NO:275: 




AAGAAAAOGG 


(SEQ 


ID 


NO: 276) ; 


AOMUUVGCQa 


(SBQ 


ID NO: 277] 




GA2VAAGGGGG 


(SEQ 


ID 


NO: 278) ; 


AAAAGGGGQO 


(SBQ 


ID 


NO: 279! 




AAAGGGGGGA. 


(SEQ 


ID 


NO: 280) ; 


AAGGGGGGAC 


(SBQ 


ID 


NO:281^ 
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AGGOGGGACT (SEQ ID 110taa2} ; 

GGGGGACTGG (SEQ ID NO: 284); 

GGGACTGGAA (6BQ ID NO:286} ; 

GACTGGAAGG (SEQ ID M0:288} ; 

CTGGAAGGGC (SEQ ID N0:290) ; 

GGAAGGGCTA (SEQ ID MO:292) / 

AAGGGCTAAT (SEQ ID l}0:294) ; 

GGOCTAATTC (SEQ ID NO: 296) ; 

GCTAATTCAC (SBQ ID HO:29S} ; 

TAATTCACTC (SBQ ID NO: 300} ; 
ATTCACTCCC (SEQ ID NOi302) ; 

TCACTCCCAA (SEQ ID MO:304) ; 
ACTCCCAAAG (SBQ ID NO:306); 
TCCCAAAGAA (SBQ ID NO:308); 
CCAAAGAAGA (SEQ ID NO:310) ; 
AAAGAAGACA (SBQ ID KO:312) ; 
AGAAGACAAG (SBQ ID NO:314) ; 
AAGACAAGAT (SBQ ID NO:316); 
OACAAGATAT (SBQ ID NO:3l8) ; 
CAAGATATCC (SEQ ID NO: 320); 
AQATATCCTT (SBQ ID NO:322) ; 
ATATCCTTGA (SBQ ID NOt324) / 
ATCCTTGATC (SEQ ID NO:32€) ; 
CCTTGATCTO (SEQ ID NO:32B} ; 
TTGATCT6TG (SEQ ID NO:330) ; 
GATCTGTGGA (SBQ ID NO: 332) ; 
TCTGTGGATC (SBQ ID NO: 334) ; 
TGTOGATCTA (SEQ ID NO: 336) ; 
TOGATCTACC (SEQ ID NO:338) ; 
GATCTACCAC (SEQ ID NO:340) ; 
TCTACCACAC (SEQ ID NO:342) ; 
TACCACACAC (SEQ ID N0:344) ; 
CCACACACAA (SEQ ID NO: 346) ; 
ACACACAAGG (SEQ ID NO: 348) ; 
ACACAAGGCT (SEQ ID NO:350) ; 
ACAAGGCTAC (SBQ ID NO: 352); 
AAGGCTACTT (SEQ ID NO: 354) ; 
GGCTACTTCC (SEQ ID NO: 356) ; 
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GGGGGGACTO (SBQ ID NO:283); 

GGGQACrroaA (SBQ ID NO: 285); 

OQACTOOAAG (SEQ ID N0:287) ; 

ACTGGAAOQO (SBQ ID 110:269); 

TaGAAGOGCT (SBQ ID NO: 291) ; 

GAAGOGCTAA (SBQ ID NO:293) ; 

AGGGCTAATT (SBQ ID NO: 295) ; 

GGCTAATTCA (SBQ ID NO: 297) ; 

CTAATTCACT (SBQ ID ]iOt299) ; 
AATTCACTOC (SBQ ID NO:301) ; 
TTCACTCCC3i (SBQ ID MO:303) ; 
CACTCCCAAA (SEQ ID 110:305}; 
CTCCCAAA3A (SBQ ID NO: 307) ; 
CCCAAAGAAa (SEQ ID MOt309) ; 
CAAAGAAGAC (SBQ ID MO:311) ; 
AAOAAOACAX (SBQ ID NO:313) ; 
OAAOACAAOA (SEQ ID NO:315} ; 
AOACAAGATA (SBQ ID NO: 317) ; 
ACAAOATATC (SBQ ID NO: 319) ; 
AASATATCCT (8SQ ID 110:321}; 
GATATCCTTO (8BQ ID NO: 323); 
TATCCTTQAT (SEQ XD N0:32S} ; 
TCCTTGATCT (SEQ ID NO:327) ; 
CTTGATCTOT (SBQ ID NO: 329); 
'raATCTGTGG (SEQ ID NO:331) ; 
ATCT6TGGAT (SBQ ID NO:333) ; 
CTSTOGATCT (SEQ ZD NO:335) ; 
OTGPOATCTAC (SBQ ID NO:337) ; 
OQATCTACCA (SEQ ZD NO: 339) ; 
ATCTACCACA (SBQ ZD NO: 341) ; 
CTACCACACA (SBQ ZD KO:343); 
ACCACACACA (S8Q ID NO:345} / 
CACAGACAAG (SBQ ID NO: 347) ; 
CACACAAGGtC (SBQ ID KO:349) ; 
CACAAGQCTA (SBQ ID NO: 351) ; 
CMGGCTACT (SBQ ID NO:353) ; 
AOGCrrACTTC (SSQ ID NO:35S) ; 
GCTACTTCCC (SEQ ID NO: 3 57); 
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CTACTTCCCT (SEQ ID NO:358) / 

ACTTCCCTOA (SEQ ID NOtSfiO) ; 
TTCCCTGATT (SEQ ID NO:362) ; 
CCCTGATTGG (SEQ ID NO: 3 64) ; 

CTGATTOaCA (SEQ ID NO:366) ; 

GATTG6CAGA (SEQ ID NO:368) ; 

TTGGCAGAAC (SEQ ID NOi370) ; 

OaCAOAACTA (SEQ ID NO: 3 72) ; 

CAGAACTACA (SEQ ID HO: 3 74); 

GAACTACACA (SEQ ID NO:376) ; 

ACTACACACC (SEQ ID NO: 3 78) ? 

TACACACCAQ (SEQ ID NO:3aO) ; 

CACACCAGIGG (SEQ ID NO:3a2) ; 

CACCAGGGCC (SEQ ID NO:384) ; 

CCAGGGCCA6 (SEQ ID NO:386) ; 

AGGGCCAGGG (SEQ ID MO:3d8) ; 

GGCCAGGGGT (SEQ ID NO:390) ; 

CCAGGGGTCA (SEQ ID KOi392) / 
AOGGGTCAGA (SEQ ID NO:394) ; 
GGGTGAGATA (SEQ ID NO:39€) ; 
GTCAQATATC (SEQ ZD NO:398) ; 
CAGATATCCA (SEQ ID NO: 400) ; 
GATATCCACT (SEQ ID NO:402) ; 
TATCCACTGA (SEQ ID NO:404) ; 
TCCACTGACC (SEQ ZD NO:406) ; 
CACTGACCTT (SEQ ID N0:408) ; 
CTGACCTTTG (SEQ ZD N0:410) ; 
GACCTTTGOA (SEQ ZD MO: 4 12) ; 
CCTTTGQATG (SBQ ID NO:414) ; 
TTTGGATGGT (SEQ ID NO:416); 
TGGATGGTGC (SEQ ID NO i 418) ; 
GATGGTGCTA (SBQ ID NO:420) ; 
TGGTGCTACA (SEQ ID NO:422); 
GT6CTACAA.G (SEQ ID NO: 424) ; 
GCTACAXGCT (SEQ ID NO: 426) j 
TACAAGCTAG (SEQ ID N0:42B) ; 
CAAGCTAGTA (SEQ ID NO:430) ; 
AGCTA6TACC (SEQ ID NO: 432) ; 
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TACTTCCCrrG (SBQ ID NO: 359) ; 
CTTCCCTGAT (SBQ ID NO; 361); 
TCCCTOATTQ (SBQ ID NO: 363); 
CCTOATTOGC (SBQ ZD NO:365) ; 
TOATTGGCAa (SBQ ID NO: 367) ; 
ATTOGCAGAA (SBQ ZD NO:369) ; 
TOGGAGAACT (SEQ ZD NO: 371); 
GCAGAACTAC (SBQ ZD NO: 3 73); 
AOAACTACAC (SBQ ZD NO:37S) ; 
AACTACACAC (SEQ ID N0:377) ; 
CTACACACCA (SBQ ID NO:379) ; 
ACACACCAGG (ESQ ID NO: 381); 
ACMGCIU3G0C (SEQ ID NO: 383) ; 
ACCAOGGCCA (SEQ lO NO:385) ; 
GAGGOCCAOG (SBQ ID NO:387); 
GGGCCAQaaa (SBQ id N0:389); 
OCCXaaOGTC (SBQ id N0:391); 
C!AGaaOTCAC> (SBQ ZD NO:393) ; 
OGGGTCAQAT (SBQ ZD 110:395); 
GGTCAGATAT (SBQ ZD NO: 397) ; 
TGAGATATCC (SBQ ZD 110:399); 
A0ATATCCAC (SBQ ID NO:401) ; 
ATATCCACTO (OQ ID MO: 403) ; 
ATCXSICTGAC (SBQ ZD MO:405) ; 
CCACTGACCT (SBQ ZD MO: 407) ; 
AGraACCTTT (SBQ ZD NO: 409) ; 
TGACCTTTGa (SEQ ID NO: 411) ; 
ACCTTTOGAT (SBQ ID IK>:413> ; 
CTTTOaATGO (SBQ ZD N0:41S) ; 
rraGATGGTG (SBQ ID NO: 417) ; 
aOATOGTOCT (SBQ ID NO: 419); 
AT(3(3TGCTAC (SEQ ID N0:42l) ; 

ogtgctacaa (sbq id no: 423) ; 
toctacaagc (sbq id noi425) ; 
ctac:aagcta (seq id no : 427); 
acaagctagt (seq id n0:429) j 
aa6ctagtac (seq id no: 431) ; 

6CTA8TACCA (SEQ ID NO: 433) ; 
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CTAGTACCAG 


{SEQ 


ID 


NO: 434) ; 


AaTACCAGTT 


{SEQ 


ID 


NO: 436) ; 


TACCAGTTGA 


(SEQ 


ID 


NO:438) i 


CCAGTTGA6C 


(SEQ 


ID 


N0i440) ; 


AGTTGAGCCA 


(SEQ 


ID 


NO: 442) ; 


TTOAaCCAOA 


(SEQ 


ID 


NO: 444) ; 


GAGCCAGATA 


(SEQ 


ID 


NO:44£) ; 


GCCAGATAAa 


(SEQ 


ID 


N0:446) ; 


CAGATAAOGT 


(SEQ 


ID 


NO: 450) ; 


6ATAAGGTAG 


(SEQ 


ID 


NO;4S2) ; 


TAAGGTAGAA 


(SEQ 


ID 


NO:454) ; 


AGGTAGAAGA 


(SEQ 


ID 


NO:456) ; 


GTAGAAGAGG 


(SEQ 


ID 


NO:458) ; 


AGAAGAGGCC 


(SEQ 


ZD 


MO:4tfO) ; 


AAGAGGCCAA 


(SEQ 


ID 


KO:462) ; 


GAGGCCAATA 


(SEQ 


ID 


N0:4e4) ; 


GQCCAATAAA 


(SEQ 


ID 


NO:4<fi) ; 


CCAATAAAGO 


(SEQ 


ID 


IX0:4£fl) ; 


AATAAAQQAG 


(SEQ 


ID 


NO:470) ; 


TAAAGGAGAG 


(SEQ 


ID 


N0:472) ; 


AAGaAGAGAA 


(SEQ 


ID 


NO:474) ; 


GGAGAGAACA 


(SEQ 


ZD 


N0:476) ; 


AOAOAACACC 


(SEQ 


ZD 


NOi47t) ; 






Tn 

LO 


viKj : 4 o u / ; 


AACACCAGCT 


(SEQ 


ID 


NOi482) ; 


CACCaiGCTTO 


(SEQ 


ID 


NO:484) | 


CCAGCTTGTT 


(SEQ 


ID 


N0:486) ; 


AGCTTGTT&C 


(SEQ 


ID 


NO:488) ; 


CTTOTTACAC 


(SEQ 


ID 


NO:490) 1 


TGTTACACCC 


(SEQ 


ID 


NO:492) ; 


TTACACCCTO 


(SBQ 


ID 


NO:494) ; 


ACACCCTGTG 


(SEQ 


ID 


NO:49€) ; 


ACCCTGTGAG 


(SBQ 


ID 


NO:498) ; 


CCTCPTGAGCC 


(SEQ 


ID 


NO: 500) ; 


TGTGAGCCTG 


(SEQ 


ID 


NO: 502) ; 


TGAQCCTGCA 


(SEQ 


ID 


NO: 504) } 


AGCCTGCATO 


(SEQ 


ID 


NO: 506) ; 


CCTGCATGGA 


(SEQ 


ID 


NO:508) ; 
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TAOTACCAQT 


(SBQ 


ID 


110:435) ; 


OTACCAGTTO 


(8BQ 


ID 


NOi437) f 


ACCAOTTQAa 


(SBQ 


ID 


NO:439) ; 


GAOTTOAGCX: 


(no 


ZD 


110:441) ; 


GTTQIUSCCAG 


(SBQ 


ID 


110:443) ; 


TGAGCCAGAT 


(SBQ 


ID 


110:445) ; 


AGCX^AGATAA 


(SBQ 


ID 


M0I447) / 


(XAOATAAGG 


{am 


ID 


1K>:449) 7 


AOATAAGGTA 


(SBQ 


ID 


MO:451) ; 


AmAGGTAOA 


(SBQ 


ID 


NO: 453) ; 


AAoarrAOAAG 


(SBQ 


ID 


NO:455) ; 


QaTAOAAGAa 


(SBQ 


ID 


MO:457) ; 


TliGAAGAGGC 


(SBQ 


ID 


NO:4S9) 7 


GIUGAGOCGA 


(SBQ 


ID 


ND:4«;i) ) 


AOIhSGCCAAT 


(SBQ 


ID 


IIO:4«3) ; 


AGOGCAATAA 


(SBQ 


ZD 


NO:465) 7 


GCGMTAAAG 


(^BO 


ID 


1K):467) 7 


ClUkTAAAGGA 


(SBQ 


ZD 


NOi4«9) t 


ATAAAGaAOA 


(SBQ 


ID 


N0:47l) ; 


AAAOGAOAaA 


(SBQ 


ID 


110:473) 7 


ASGMfiAGAAC 


(SBQ 


ZD 


M0:475)i 


OMUyQAACAC 


(SBQ 


ZD 


llO!477) ; 


6JU3AACACCA 


(SBQ 


ZD 


110:479) 7 


GAACaCCAGC 




XD 


nOi4Bl] 7 


ACAOCAGCTT 




ZD 


NO:483) 7 


ACX»aCTT(3T 


(SBQ 


ZD 


NO:485) 7 


CftOCTTOTTA 


(SBQ 


ID 


NO: 48 7) 7 


OCTTOTTACA 


(SBQ 


ID 


NO:4a9) 7 


TCQiTTACACC 


(SBQ 


ID 


liO:491) ; 


QTTACACCCT 


(SBQ 


ID 


M0:493) / 


TACUCCCTGT 


(8SQ 


ID 


110:495) ; 


CACCXTOTGA 


(8SQ 


ID 


NO: 497) 7 


CCCTOTGAOC 


{SEQ 


ID 


MO:499) 7 


CTGTGAGCCT 


(SBQ 


ID 


NOtSOl) 7 


(3T(UU3CCTQC 


(SEQ 


ID 


NO:503) 1 


OAOCCTQCAT 


(SEQ 


ID 


NO: 505) ; 


CK^nCCATGG 


(SSQ 


ID 


NO: 507) ; 


CTSCATGOAA 


(SEQ 


ID 


NO: 509) 7 
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TGCATOGAAT 


(SEQ 


ID NO: 510) ; 


GCXTGGAATG 


(SBQ 


ID NO: 511) ; 




GATGGAATGG 


(SEQ 


ID NO: 512) / 


ATOOAATaOA 


(^Q 


ID NOi513) ; 




TOGAATGGAT 


(SEQ 


ID NO: 514) ; 


OGtAATaOATO 


(SBQ 


ID MO: 515) ; 




OAATGGATGA 


(SEQ 


ID N0:S16) ; 


AATQOATCSAC 


(SBQ 


ID NO: 517) ; 




ATGGATGACC 


(SBQ 


ID NO: 518) ; 


TGGATGACCC 


(SBQ 


ID NO:519) i 




GGATGACCCT 


(SEQ 


ZD NO: 520) ; 


GAVQACCCTO 


(SBQ 


ID NO: 521) ; 




ATGACCCTGA 


(SEO 


ID NO: 522) ; 


TGACCCTGAO 




ID NOt523) ; 




GACCCTGAGA 


(SEO 


ID NO: 524) : 






ID NO: 525) ; 




CCCTQAGAGA 




ID NO*526) • 


CCrraAGiAGAS 


(SBQ 


ID NO: 527) t 




CTGAGAGAGA 




ID KO*52a) 1 


TGLAtSAQAGAA 


(ABO 


ID NO: 529) ; 




GAGAGAGAAG 


(SEQ 


ID NOiSaO) 1 


AGAOAGAAGT 


(mo 


ID NO: 531) ; 




GAGAGAAC3TO 


(SEQ 


ID NTOi 532) 1 


AQAGAAGTCT 


(SBQ 


ID NO:533) ; 




GASAAQTGTT 


(SBQ 


ID NO: 534) ; 


AQAAOTGrrXA 


(SBQ 


ID NO:535) 1 




GAAGTQTTAG 


(SEQ 


ID NO; 536) ; 


AAGTOTTAGA 


(BSQ 


ID NO:537) ; 


IS 


AGTQTTASAG 


(SEQ 


ID NO: 538) ; 


CS1X3TTAGAGT 


(mo 


ID NO: 539) ; 




TGTTAGAG^TG 


(SEQ 


ID NO: 540) ; 


CSTTAGiAGTGQ 


(8BO 


ID NO:541) ; 




TTAGAGTGGA 


(SEQ 


ID NO:542) r 


TASAGTGGAG 


(SBQ 


ID NO: 543) ; 




AGASTGGAGG 


(SEQ 


ID KG: 544) ; 


GA3TG(3AGGT 


(SBQ 


ID NO: 545) ; 


m 


AGTGGAGGTT 


(SEQ 


ID NO: 546) i 




(SBO 


ID BIO:547) ; 


\^ 


TGGAGGTTTG 


(SEQ 


ID NO:548) ; 


C9<3AGK3TTT<3A 

^^^^^ ^ w « ^^Bw 


(MBQ 


ID NO;549) ; 


i=-: 


GAGGTTTGAC 


(SEQ 


ID NO: 550) i 


A99TTT(1ACA 


(SBQ 


ID NO:551) ; 




GGTTTQACAa 


(SEQ 


ID NO : 552 ) ; 




(SBQ 


ID NO:553) ; 


w 


TTTGACAGCC 


(SEQ 


ID NOi554) 1 


TTQACACrCCO 


(SBQ 


ID NO:5S5) j 




TGACAGCCOC 


(SEQ 


ID NO: 556) ; 


GACAOCCGkX! 


(SBQ 


ID NO: 557) ; 




ACAOCCGCCT 


(SEQ 


ID NO: 558) ; 


CAOCCOCCTA 


(SEQ 


ID NO: 559) ; 




AGCC6CCTA0 


(SEQ 


ID NO: 560) ; 


OCGOCCTAGC 


(SBQ 


ID NO: 561) ; 




CCX3CCTAGCA 


(SBQ 


ID NO: 562) ; 


COCCTAGCAT 


(SBQ 


ID NO: 563) ; 




GCCTAGCATT 


(SEQ 


ID NO: 564) ; 


CCTAOCATTT 


(SBQ 


ID NOt565) 1 




CTAGCATTTC 


(SEQ 


ID NO: 566) ; 


TAOCATTTCA 


(SBQ 


ID N0:567) j 




AGCATTTCAT 


(SEQ 


ID NO: 568) ; 


GCSiTTTCATC 


(SEQ 


ID NO:369) ; 




CATTTCATCA 


(SEQ ID NO: 570) ; 


KTTTCKTCJiC 


(8£Q 


ID 110:571) ; 




TTTCATCACO 


(SEQ ID NO: 572) ; 


TTCATCACOT 


(SEQ 


ID NO:S73) ; 




TCATCACGTG 


(SEO 


ID NO: 574) ; 


CATCACGTGa 


(SEQ 


ID NO: 575) ; 




ATCACGTGGC 


(SBQ 


ID NO: 576) ; 


TCAOOTGGCC 


(SBQ 


ID HO: 577) ; 




CACGTGGCCC 


(SEQ 


ID NO: 578) ; 


AOGrTGOCCCO 


(SBQ 


ID NO:579) ; 




CGTGGCCCGA 


(SEQ 


ID NO: 580) ; 


GTOGCCGGAa 


(SBQ 


ID NO: 581) ; 




TGOCCCGAGA 


(SEQ 


ID NO: 582) ; 


GGGCCGAGAO 


(SBQ 


ID NO: 583) ; 




■ QCCCGAGAGC 


(SEQ ID N0:5fi4) ; 


CCCGAGAGCT 


(SEQ 


ID NO: 585) ; 
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CCGAGAGCTG (SEQ 
GAaAGCTGC^ (SEQ 
GAGCTOCATC (SEQ 
QCTGG\TCCG (SEQ 
TGCATCCGGA (SEQ 
CATCCaOAOT (SEQ 
TCCGGAOTAC (SEQ 
OSGAGTACTT (SEQ 
GA6TACTTCA (SEQ 
GTACTTCAAG (SEQ 
ACTTCAAOAA (SEQ 
TTCAAGAACT (SEQ 
CAAGAACTGC (SEQ 
AGAACTGCTG (SEQ 



ID NO: 586) 
ID NO: 588) 
ID NO: 590) 
ID NO: 592) 
ID NO: 594) 
ID NO:S9«) ; 
ID NO: 598) ; 
ID NO: 600) ; 
ID NO: 602) ; 
ID NO: 604) ; 
ID NO: 606) t 
ID NO:608) ; 
ID NO: 610) ; 
ID NO: 612) ; 



CGAOAGCTGC 


(SEQ 


ID 


NO: 587) ; 


AOAGCTOCAT 


(SBQ 


ID 


NOi589) ; 


A6CTGGATCC 


(8SQ 


ID 


NO:591) t 


CTGCATCCQO 


(SBQ 


ID 


MO:593) ; 


GCATCXSGGAa 


(68Q 


ID 


M0:595) ; 


ATCCGGAOTA 


(SKi 


ID 


MO: 597) ; 


CCQQAGTACT 


(OBQ 


ID 


MD:599) ; 


OGAGTACTTC 


(SBQ 


ID 


NO:601) ; 


Ai3TACTTC!AA 


(SOQ 


ID 


N0:603) i 


TACTTCAAOA 


(«BQ 


ID 


NO:605) ; 


CTTGAAGAAC 


(OQ 


ID 


NO: 607) ; 


TCAAOAACTG 


(SBQ 


ID 


NO:609) ; 


AMAACTGCT 


(no 


ID 


NO: 611) ; 


GAACTOCTGA 


(SBQ 


ID 


MO:613) . 



15. A strain of HIV-1 aocordixig to claim 14 whorein Mid deletion m&compassea at 
least one of the following decamicleotidc dcleti<m8 from Ike n^geae: 



ACCAOCTTGT 


[SEQ 


ID 


NO: 803) 


COhGCTTGTT 


(«BQ 


ID 


NO: 804] 


CAOCTTGTTA 


[SEQ 


ID 


NO: 805] 


AOCTTOTTAC 


cm 


IB 


NO:806] 


GCTTGTTACA 


[SBQ 


ID 


NO: 807] 


CTTOTTACAC 




ID 


110:808] 


TTOTTACACC 


[SEQ 


ID 


tK>t809] 


TGTTXCACCC 


t«BQ 


ID 


MOiSlO] 


GTTACACCCT 


[SEQ 


ID 


NO: 811] 


TSRCACCCTG 


C«BO 


ZD 


110:812] 


TACACCCTOT 


[SEQ 


ID 


NO: 813] 


AGAGCCIGTa 


IMEQ 


ZO 


NO: 814] 


CACCCTGTGA 


[SEQ 


ID 


NO: 815] 


ACCCrOTGAO 


ISBQ 


ID 


NO:816] 


CCCTGTaAGC 


[SEQ 


ID 


NO: 817] 


ccraTOAacc 


C8BQ 


ZD 


NO: 818] 


CTGTGAGCCT 


[SBQ 


ID 


NO: 819] 


TQTOAOCCra 


(flBQ 


ID 


NO: 820] 


GTGAGCCTGC 


[SEQ 


ID 


NO: 821] 


TGAGCCTGCA 


(SBQ 


ID 


NO: 822] 


GAGCCTOCAT 


(SEQ 


ID 


NO: 823] 


AGCCTOCATG 


[SBQ 


ZD 


NO:824] 


QCCTGCATGQ 


[SEQ 


ID 


NO: 825] 


(XnOCATQGX 


[SBQ 


ID 


NO: 826] 


CTGSCATGGAA 


[SEQ 


ID 


NO: 827] 


TGCATGGA2VT 


tSBQ 


ID 


MO:828] 


GCATOOAATG 


[SEQ 


ID 


NO: 829] 


CAT0GAXTG6 


fSSQ 


ID 


NO: 830] 


ATGQAXTGGA 


[SEQ 


ID 


NO 1831] 


TGGAATGGAT 


iSEQ 


ID 


MO: 832] 


GGAATGGATO 


[SEQ 


ID 


NO: 833] 


GJU^TGOATGA 


CSBQ 


ID 


NO:834] 


AATQ(3ATGAC 


[SEQ 


ID 


MO: 835] 


kTGaHTGhCXl 


[8fEQ 


ID 


MO:836] 


TGGATGACCC 


[SEQ 


ID 


NO: 837] 


GOATGACCCT 


[SBQ 


ID 


NOt838] 


(3ATGACCCTa 


[SEQ 


ID 


NO: 839] 


ATGACCCTGH 


[SBQ 


ID 


MO: 640] 


TCSACCCTGAG 


[SEQ 


ID 


NO: 841] 
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16. A strain of HIV-1 according to claim t wherdn aid del^ioa tmcomfmssat at 
least one of the following decamideotides tcook the LTR v^ioii of HW-1i,n43: 



GciTrrracc 


(SEQ 


ID 


NO: 652 


1 ; 


CTTTTTOCCT 


(SBQ 


ID 


NO: 653) ; 


TTTTTOCCTQ 


(SEQ 


ID 


NO: 654 


L « 
f * 


TTTTGCCTGT 


(SBQ 


ID 


110:655) ; 


TTTGCCTQTA 


(SEQ 


ID 


NO: 656' 


1 ; 


TTOCCTOTAC 


(SBQ 


ID 


liOt657) ; 


TGCCTOTACT 


(SEQ 


ID 


NO: 658] 


f # 


GCCTGTACTO 


(SBQ 


ID 


NO: 659) ; 


CCTOTACTGO 


(SEQ 


ID 


NO; 660] 


• ; 


CTQTACTGGG 


(SBQ 


ID 


NO; 661) ; 


TQTACTGQOT 


(SEQ 


ID 


NO; 662] 


; 


(TfACTGGOTC 


(BBQ 


ID 


MO; 663) ; 


TACTGGaTCT 


(SEQ 


ID 


NO: 664] 


/ 


ACTQQGTCTC 


(SBQ 


ZD 


NOt665) ; 


CTGGGTCTCT 


(SEQ 


ID 


NO 1 666) 


; 


TCtQGTCTVCTC 


(SEQ 


ID 


MO: 667) ; 


C5GGTCTCTCT 


(SEQ 


ID 


NO: 668) 


• 


(MTCTCTCTQ 


(SBQ 


ZD 


110x669) ; 


6TCTCTCTGG 


(SEQ 


ZD 


110:670) 


• 

r 


TGTCTCTGOT 


(SBQ 


ID 


110:671) ; 


CTCTCTQQTT 


(SBQ 


ID 


NO: 672) 


i 


TGTCTGGTTA 


(8BQ 


ZD 


NO: 673) ; 


CTCTGQTTAO 


(SBQ 


ID 


MO: 674) 


9 


TGTCTGGTTA 


(ABQ 


ZD 


liO:675) ; 


CTGGTTAGAC 


(SBQ 


ID 


NO: 676) 


9 


TQSTTAGACC 


(SBQ 


ZD 


NO; 677) ; 


G0TTAC3ACCA 


(SEQ 


ID 


HO: 678) 


i 


arTTAOAccAa 


(BBQ 


IS 


HO:679) ; 


TTAQACCAGA 


CSEQ 


ID 


NO: 680) 


; 


lAOACCAGAr 


(SEQ 


ZD 


NO: 681) ; 


AGACCAGATC 


(SEQ 


ID 


NO: 682) 


! 




(SBQ 


ID 


NO: 683) ; 


ACCACSATCTQ 


(SBQ 


ID 


NO:684) 


; 


CCaUSATCTGA 


(8B0 XD NO: 685) ; 


CAGATCTGAG 


(SBQ 


ID 


NO1686) 


/ 


AOATCTGAOe 


(«8Q 


ZD 


N0:687) ; 


GATCTQAGCC 


(SEQ 


ID 


NO: 688} 




ATCXCAGCCT 


(SBQ 


ZO 


NO: 689) ; 






Tr» 


mm: 09 V J 






(SBQ 


ZD 


NO:691) ; 


TGAGCCTGGQ 


(SEO 


ID 


NO: 692) 






(SEQ 


ZD 


NO: 693) ; 


AGCCTOGQAG 


(SBQ 


ID 


NO:694) 




OGCTaaoAac 


(SBQ 


ZD 


NO:695) ; 


CCTGOGAGCT 


(SEQ 


ID 


NO: 696) 


• 


craoQAscTc 


(SBQ 


ID 


NO:697) ; 


TOGGAGCTCT 


(SEQ 


ID 


NO: 698) 


; 


CK3GAaCTCTC 


(SBQ 


ZD 


NO:699) ; 


GGA6CTCTCT 


(SEQ 


ID 


N0:700) 


; 


GAOCTCTCTQ 


(SEQ 


ZD 


110:701) ; 


AGCTCTCTGO 


(SEQ 


ID 


NO: 702) 




OGTCTCTQOC 


(SBQ 


ID 


Il0t703) ; 


CTCTCTGGCT 


(SEQ 


ID 


NO: 704) 


f 


TCTCTQQCTA 


(SEQ 


ID 


NOilOS) ; 


CTCTGQCTAA. 


(SEQ 


ID 


NO 1 706) 




TCTGOCTAAC 


(SEQ 10 110:707) 1 


CTGOCTAA.CT 


(SEQ 


ID 


NO: 708) 


• 


TOOCTAACTA 


(SBQ 


ZD 


110:709) ; 


GGCTAACTAG 


(SEQ 


ID 


NO: 710) 


• 


OCTAACTAOa 


(SBQ ID NO: 711) ; 


CTAACTAGGG 


(SEQ 


ID 


NO:712) 


; 


TAACTAGOaA 


(SBQ ID NO: 713) ; 


AACTAGGGAA 


(SBQ 


ID 


NO: 714) 


i 


ACTAGGGAAC 


(SEQ ID NO: 715) ; 


CTAGGGAACC 


(SEQ 


ID 


NO: 716) 


m 
f 


TAQQGAACCC 


(SEQ 


ID 


N0:717) ; 


A6GGAACCC\ 


(SBQ 


ID 


NO: 718) 


• 

9 


GGGAACCCAC 


(SBQ 


ID 


NO:719) } 


GGAACCCACT 


(SEQ 


ID 


N0:720) 


9 


GAACCCACra 


(SEQ 


ID 


NO: 721) ; 
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AACCCACTOC 


(SEQ 


ID 


NO:722) ; 


CCCA.CTGCTT 


(SBQ 


ID 


NO:724) ; 


CACTGCTTAA 


(SEQ 


ID 


NO: 726) / 


CTOCTTAAQC 


(SEQ 


ID 


NO:728) ; 


GCTTAAOCCT 


(SEQ 


ID 


NO:730) ; 


TTAAOCCTCA 


(SBQ 


ID 


NO: 732) ; 


AAGCCTCAAT 


(SEQ 


ID 


NO:734) ; 


OCCTGAATAA 


(SEQ 


ID 


NO:736) ; 


CTCAATAAAG 


(SEO 


ID 


NO: 738} ; 


CAATAAAGCT 


(SEQ 


ID 


NO: 740) ; 


ATAAAOCTTO 


(SEQ 


ID 


NO:742) s 


AAAGCTTGCC 


(SBQ 


ID 


NO; 744) ; 


jkGCTTGCCTT 


(SEO 


ID 


NO 1 746) ; 


cttgccttga 


(SEQ 


ID 


NO: 748) ; 


TGCCTTQAQT 


(SBQ 


ID 


NO:750) ; 


CCTTGAQTOC 


(SEQ 


ID 


NO:752) ; 


TTQAQTGCTT 


(SEQ 


ID 


NOs754) ; 


OAGTGCTTCA 


(SBQ 


ID 


NO:756) 1 


OTGCTTCAAG 

A ^^^^ A A a* Map 


(SEQ 


ID 


NO: 758) ; 






ID 


NO:7€0) : 


TTCfAAGTAC^* 


(SBO 


ID 


NO:7€2) ; 


CAAGTACTGT 


(SBQ 


ID 


NO: 764) i 


AfJTAGTGTGT 


(SBO 


ID 


NO: 766) i 


TAGTCtrGTOG 


(SBQ 


ID 


NO: 768) ; 


QTGTGTGCCC 


(SEQ 


ID 


NO: 770) ; 


GTaTQCCCGT 


(SBQ 


ID 


N0:772) » 


GTGCCCGTCT 


(SEQ 


ID 


NO s 774) ; 


GCCCGTCTQT 


(SBQ 


ID 


NO:776) ; 


CCGTCTGTTG 


(SEQ 


ID 


NO 1778) ; 


GTCTOTTGTQ 


(SEQ 


ID 


NO: 780) ; 


CTGTTGTGTG 


(SBQ 


ID 


NO:782) ; 


GTTQTGTGAC 


(SBQ 


ID 


NO: 784) ; 


TGTGTGACTC 


{SEQ 


ID 


NO: 78$) ; 


TGTGTGACTC 


(SEQ 


ID 


NO: 788) ; 


TGTGACTCTQ 


(SEQ 


ID 


NO: 790) ; 


TQACTCTGGT 


(SEQ 


ID 


NO: 792) ; 


ACTCTGGTAA 


(SEQ 


ID 


NO: 794) ; 


TCTGGTAACT 


(SEQ 


ID 


NO: 796) ; 



ACCCACTGCT 


(SBQ 


ZD 


N02723) ; 


CCkCTOCTTA 


(SEQ 


ID 


NO:725) ; 


ACTGCTTAAO 


(SBQ 


ZD 


NO:727) ; 


TGCTTAAOCC 


(SEQ 


ZD 


NO:729) ; 


CTTAAGCCTC 


(SEQ 


ZD 


NOr731) ; 


TAAOCCTCAA 


(SBQ 


ZD 


NO;733) ; 


AOCCTCAATA 


(SBQ 


ZD 


NO:735) ; 


CXrXCAATAAA 


(SEQ 


ZD 


NO:737) ; 


TCAATAAAGC 


(SEQ 


ZD 


NO:739) ; 


AATAAAGCTT 


(SEQ 


ZD 


NO:741) ; 


TAAAGCTTQC 


(SEO 


ZD 


NO: 743) ; 


AAGCTTGCCT 


(SEQ 


ID 


NO:745) ; 




(SBO 


ID 


NO: 747) 1 


XTPOGCTTPGAG 


(SEO 


ID 


NO:749) ; 


GC5GTTQAQTG 


(SBO 


ID 


NO:751) ; 


CTTQAGTGCrr 


(SEQ 


ID 


NO: 753) ; 


TGAflTCiCTTC 


(SBQ 


ID 


NO: 755) ; 


A(9TOCTTCAA 


(SBQ 


ID 


NO: 757) ; 


TSG^TCAAGT 


(SEO 


ID 


NO:759) ; 


CTTCAAGTAG 


(SBO 


ID 


NO: 761) ; 


IXaUUSTAGlXS 


(SBQ 


IS 


NOt763) ; 


AAiaTAGTGrrG 




Tft 


NO: 765) 


flVASTGTCrrG 


(SBO 


ID 


MOt767) ; 


AGtTOTOTGCC 


(SBQ 


XD 


NO: 76 9) ; 


TGrrOTOCCCG 


(SEQ 


ID 


NO: 771) ; 


TQTOCCCSSTC 


(SBQ 


ID 


NO: 773) ; 


TQGCCGPTCTG 


(SBQ 


ID 


NO: 775) ; 


CGGOTCraTT 


(SBQ 


ID 


NO:777) ; 


(XWCTGTTOT 


(SEQ 


ID 


NO: 779) ; 


TGTGTTGTOT 


(SEQ 


ID 


NO: 761) ; 


T3TTGT6TGA 


(8EQ 


IB 


NO: 783) ; 


TTQTQTGACT 


(SEQ 


ID 


NO: 785) ; 


GTQTQACTCT 


(SBQ 


ID 


NO: 787) i 


GTGrTGACTCT 


(SBQ 


ID 


NO: 789) ; 


OTOACTTCTGO 


(SEQ 


ID 


NO: 791) ; 


GACTCTGGTA 


(SEQ 


ID 


NO: 793) ; 


CTCTGGTAAC 


(SBQ 


ID 


NO: 795) ; 


CT06TAACTA 


(SBQ 


ID 


NO: 797) ; 
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TGGTAACTAG (SEQ ID NO:79S) ; GOTAACTAGA (SBQ ID NO: 799) . 

1 7. An isolated strain of HIV- 1 or a biological source flicceof, said HI V-1 having die 
following characteristics: 

(i) is substantially non-patftogenic in human sufcjjecti; and 

(ii) carries a modified nef gene which oioodet a nrf gene fffoduct 
substantially immunologically non-interactive widi and^odiea to amino 
acids 162 to 177 of Nef in wild-lype HTV-I. 

18. An isolated strain of urV-1 aeconliDg to claim 17 >ii4ieiein said iK/'g 

is interactive with antibodies to wUd-type HIV-1 Ncf but is aiibMantially non-interactive 
with antibodies to amino acids 162 to 177 of wikl-4ype HIV-1 Nef protein. 

19. An isolated strain of HlV-1 accoiduig to claim 17 co m p rii i n g a mutation in hs 
genome such that amino acids 162 to 177 of wild-type HIV-I Kef are substantially not 
represented in aNef protein or derivative thereof produced bjr aaid isolated HIV-1 strain, 
or are reixesented in an amount insufficient to induce an immuna le^onae to fliat region 

of Nef. 

20. An isolated strain of HIV-l according to claim 17 wherein said strain of HlV-1 
carries a mutation in one or more of nucleotides 9271 to 9J17 of HIV-1,4L43 or in a 
functionally equivalent region in another pathogenic strain of MV-l. 

21. An isolated ncm-pattiogcnic strain of HIV-1 con^Mri«ng a genome which is 
substantially incapj^le of hybridizing undw medhun stringeai conditions to a imdeic 
acid molecule comprising a sequence of nucleotides which meodes ali or part of amino 
acids 162 to 177 of wild-type HIV-1 Nef. 

22. A non-pathogenic HIV-1 isolate which: 

(i) induces an immune lesponse in a human or primate Sttlyect; 

(ii) does not substantially produce a proliferative reqxMase or cytokine 
production to a nUtogen. alloantigen and/(» recall antigen relative to a 
healthy, non-infected subject; and 
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(iii) is a*s<antially incapable of inducing an 

162 to 177 of wlld-iypc HIV-1 Nef protein. 

23. A viral isolate which: 

CO is interactive to antibodies to « ^ycoproteio ftom MV-I selected from 
gp41-45. gpl20 and gpl60; 

(ii) is substantially non-j^athogenic in human sutgedi; and 

(iii) carries a deletion mutation of at least tea vucleotidM in a region 

corresixmding to all or part of amino acids lfi2 to 177 «iM»ded by the iw^ 
gene of a pathogenic strain of HIV-1. 

24, A viral isolate which: 

(i) is reactive to antibodies to a glycoprotein fiom HlV-1 selected fiom 
BP41-4S, gpI20 and gpl60; 

(ii) carries a deletion of at least ten nucleotides in a ragioa comsponding to 
the ne/gene in HIV-l,gL43; and 

wherein said deletion encompasses one or more of the follow^ decmcleotides fiom 
the iK/gene of HIV-Ini^j or corw^mnding sequences 6am Mother |Mlliogenic strain 
of HIV-1: 

ACCAQCTTGT [SEQ ID NO: 4] CCIW9CTTCTT IBUQ ID MOsS] 

CAGCTTGTTA [SEQ ID NO: 6] AOCTTGTTAC C«SQ ID HO: 7] 

GCTTOTTACA [SEQ ID NO: 8] CTTOTTACAC [SBQ ID HOj»] 

TTOTTACACC [SEQ ID NO: 10) TOTTACACCC [8BQ ID HO: 11] 

OTTACACCCT [SEQ ID NO: 12] TTACACCCTO (580 ID HO i 131 

TACACCCTGT [SEQ ID NO: 14] ACACCCTOTa [8BQ ID NO: IS] 

CACCCTOTGA [SEQ ID NO: 16] WJCCTGTOAa {SBQ ID NO: 17] 

CCCTGTGAQC [SEQ ID NO: 18] CCTOTGAGCC (89Q ID MO: 19] 

CTGTGAGCCT ISEQ ID NO: 20] TQHX3A0CCTO ID MO: 21] 

QTGAGCCTGC [SEQ ID NO:22] TOUaCClQCA iSEQ ID 110:23] 

GAGCCTGCAT [SEQ ID NO: 24] AflOCTOCATa [SEQ ID MO: 25] 

GCCTGCATOQ [SEQ ID NO; 26] CXrrOCSlTGGA [M(Q ID MO: 27] 

CTGCATGGAA [SEQ ID NO: 28] TOCATGGAAT [8SQ ID NO: 29] 

GCATGGAATG [SEQ ID N0:30] CXTOGAATGG tSBQ ID N0:31] 

ATGGAATGQA [SEQ ID N0:32] TGGAXTGGAT (SEQ ID NO:33l 

GGAATGGATG [SEQ ID NO: 34] GAATGGATGA [SEQ ID NO: 35] 
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AATGQATGAC [SEQ ID NO:36l ATGQATaACC (SBQ ID MO:37] 

TGGATGaCCC [SEQ ID NO:38l OOATCACCCT [SBQ ID NO: 39] 

eATGACCCTG [SEQ ID WO: 40] ATOACCCTOA [SEQ ID NO 8 41] 

TGACCCTGAG [SEQ ID NO; 42] 

25. An isolated slnin of HIV-1 having the ideatifying chmcteristks of HIV-1 isolate 
ECACC Accession No. V9410I706 or HIV-1 isolate ECACC Aeceoion No. 
V941031169 or ECACC Accession N. V95031022. 

26. An isolated strain of HTV-l which is loetive to antftodies to « glyooprotein of 
HIV-1, i.« capable of inducing an immune re^onae to at least ane atgi^^^polaad/ot env 
and which is incapable of directing sjmtfaesis of a n^gene pnidoct or a fiill length nef 
gene product 

27. An isolated strain of HIV-1 aocofdtog to claim 26 whenan the glycoprotein is 
at least one of gp41-45. gpl20 and/or gpl60. 

28. A nucldc acid molecule or a part, fiwgniont or deriviliwthcwoffiom an HIV-1 
isolate defined in any one of claims 1 to 26. 



29. A molecular infectious clone eoraprisuig a nucleic aead ^►M'^nrnlf aoeording to 

claim 28. 



30. A metfiod for inhibiting or reducing productive infeetion of m individual by a 
patiiogenic strain of HTV-l, said method con^sing admfaiislerittg to a sulqect a non- 
pathogenic isolate of HIV-1 in as amount effootive to infect tatfet oella and to genentte 
target cells canying DNA derived torn said noo-patfaogenie lCV-1. 

31. A method accotding to claim 30 wfaeraui said isolate ia capaMg nf iirimiil^ring 
an immune response in a human or primate against at least one glycoprotein on HI V>1 
whilst not substantially reducing proliferative responses and cytokine production to a 
mitogen in said human or primate compared to a healthy, non-infected faunan or primate 

subject 
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32. A method accoiding to clwtn 31 wherein said iaolale cMtkt oac or men 
mutations in its genome resulting in the inability to direct q^nthesia of at least one 
polypeptide or protein fimn a padiogenic strain of HIV-l. 

33. A method accoiding to daim 31 wfcerdn said iaolalB cairiea one or more 
mutations in its genome resulting in said genome directing qfvAasia of a trancated form 
of a polypeptide or protein from a patfMgenlc strain of HIV-1. 

I 

34. A mediod accoiding to claim 32 at 33 vdusrein said IOV-1 faolste carries a 
mutation in its ne/gene andA^ long terminal repeat (LTR) nagion or in a fimctionaUy 
equivalent location in fiie HIV-1 genome. 

35. A mediod accoiding to claim 34 whecwn said HIV-i isolate is interactive to 
antibodies to gp41-4S. gpl20 and/or gp160 of fHV-l and canioa a defletion of at least 
ten contiguous nucleotides in a region oozrespondnig to tius gene «id/or LTR region 
of poAogenieHIV-l. 

36. A method according to claim 35 wherein said mV-l iairfatecneodes a modified 
Nef protein without amino acids 162 to 177 of ^nM-typc HlV-lffi^ Nrf. 

37. A mediod according to claim 35 or 34 wherein said mV-l isolate is capable of 
inducii^ an immune response to at least one dgag, pol aaiht trw. 

38. A method accoiding to claim 37 wherein said mv-llaolateeanies a deletion in 
its genome of at least 10 nucleotides witikin the region frem imdcotide 8787 to 
nucleotide 9709 using the nucleotide numbertng of HIV-li^i^. 

39. A xnetixod according to claim 38 wlmdii said isolaie caniea a deletion in its 
genome of at least 10 nucleotides within a legjoD selected from the fiat consisting of: 
nucleotide (0 8830-8862; 

(ii) 9009-9035; 
CiiO 9019-9029; and 



SENT BY:Davies Colllson Cave ; 6- 6-95 ; 18:20 ; MELBObRNE-' 516 742 4366;»48/34 



-229- 

(iv) 9033-9049. 

40. A method according to claim 3« wherein said isolirte eairies a ddetioa in its 
genome of at least 10 nucleotides withia a re^ba adeeted fi«m flie Hat omisisdng of: 
nucleotide (v) 9281-9371; 

(vO 9281-9362; 

(vii) 9105-9224; and 

(viii) 9271-9370. 

41. A method acconSng to daim 38 whef«tn said IsoliCe etrriet a deletion in its 
genome of at least 10 nucleotides within a regioa selected fiom the list consiating o£ 
nucleotide (ix) 8882-8928; 

(x) 8850-9006; 

(xi) 8792-9041; and 
(Xll) 9112-9204. 

42. A method according to claim 38 wheiein said isolate evrias a deletion in its 
genome of at least 10 nucleotides within a region aelected fiom the Kit <^w**l*riT*g of: 
nucleotide (jdii) 9105-9224; 

(xiv) 9389-9395; and 

(XV) 9281-9366. 

43. A mediod according to claim 38 wherein said deletion cBcompasKS at least one 
of the following decanucleotidea fiom the nef gene of mV-l^n^: 
ATGGGTGGCA (SEQ ID NO:2) / TaaOTOGCAA (BBQ ID IIO:3); 
GWOTOGCAAG (SEQ ID NO:4) OOTOGCAAGT (CIQ ID !IO:5) ; 
GTGGCAAOTQ (SEQ ID NO: 6); TOQCJAAQTaO («|Q ID M0t7) / 
GKSCaAGTGGT (SEQ ID NO: 8)/ OC3UU5TGGTC 18BQ ID 110:9); 
CAAGTGGTCA (SEQ ID N0:10) ; AAOTOQTCAA (8EQ ID NOrll); 



AGTGGTCSVAA (SEQ ID NO: 12) 

TQGTCAAAAA (SEQ ID NO: 14) 

C5TCAAAAAGT (SEQ ID NO: 16) 

CAAAAAGTAO (SEQ ID NO: IB) 



GWSQTCAAAA (8BQ ID H0tl3) r 

OOTCAAAAAa (SEQ ID li0:15>; 

TCAAAAAflTA (SEQ ID N0:17); 

AAAAAOTAGT (SEQ ID 110:19)} 



AAl^GTAGTG (SEQ ID N0:20) ; AAA(3TA6TGT (SEQ ID M0:2l) ; 
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AAGTAGTOTG (SBQ ID NO>22} / 
GTAGTGTGAT (SEQ ID NO: 24) ; 
AGTGTOATTG (SEQ ID NO;26) ; 
TGTGATTGCa. (SEQ ID NO:28) ; 
TOATTOGATO (SEQ ID NO:30) ; 
ATTCGXTGGC (SEQ ID MO:32) ; 
TOaATOaCCT (SEQ ID NO: 34) ; 
GA.TGGCCTGC (SEQ ID N0:36) ; 
TGOCCraCTa (SEQ id N0:38); 
GCCTGCTOTA (SEQ ID NO:40) ; 
CTQCTGTAAQ (SBQ ID N0:42) ; 
GCTGTAAGGa (SBQ ID NO:44) ; 
TGTAAGOaAA (SBQ ID N0:4£) ; 
TAAGGSGAAAS (SEQ ID MO:40) ; 
AOGGAAAGAA (SEQ ID NO:50) ; 
OGAAAGAATO (SEQ ID NO:52) ; 
AAAGAATGAS (SZQ ID NOt54) i 
AOAATOAOAC (SEQ ID N0:5C) ; 
AATGA6ACGSA (SEQ ID NO:5d) ; 
TGAOACCSASC (SEQ ID N0:60) i 
AOACGAOCTO (SEQ ID NO:62); 
ACaAOCTOAO (SEQ ID MOt64)| 
GAOCTOAGCC (SEQ ID NO: 66); 
GCTGAOCCAG (SEQ ID NO:6a); 
TGAGCCAQCA (SEQ ID NO:70) ; 
AQCCAGCAQC (SEQ ID N0:72)j 

CCAGCAQCAG (SEQ ID H0:74)| 
AGCAaCSAaAT (SEQ ID NO: 76); 

CAGCAGATGG (SBQ ID N0:7«) ; 

GCAGATGOGO (SEQ ID NO: 80); 

AGATGGGOTG (SEQ ID N0:82) ; 

ATGGGGTGQG (SEQ ID M0;84) ; 

GCSGGTGGGAG (SEQ ID NO:86) ; 

QGTGG6AGCA (SEQ ID NOt86); 

TGGGAGCAGT (SEQ ID NO:90) ; 

GGAGCAQTAT (SEQ ID NO:92) ; 

AGCAGTATCT (SEQ ID N0:94) ; 

CAGTATCTCG (SEQ ID NO:96) ; 



AOTA3TGTGA 


(SBQ 


ID 


MOi23) } 


TAaTGTGATT 


(SEQ 


ID 


NO:25) ; 


GTGTGATTGG 


(SBQ 


ID 


IIO:27) ; 


6TGATTGGAT 


(SEQ 


ID 


NO: 29) ; 


GATTGGATG6 


(SEQ 


ID 


110:31) ; 


rraoKToacc 


(SBQ 


ID 


MO: 33) ; 


GGATOGCCTG 


(SBQ 


ID 


80:35) ; 


ATOGCCTGCT 


(SBQ 


ID 


HO: 37) ; 


GGCCTGCTGT 


(SBQ 


ID 


lflO:3d) ; 


CCTOCTGTAA 


(SBQ 


ID 


llO:41) ; 


TGGTGrTAAGG 


(SBQ 


ID 


1K>:43) ; 


CT6TAAGGGA 


(SBQ 


ID 


BO:45) ; 


GTAAGGGAAA 


(SBQ 


ID 


IK): 47) ; 


AA8G6AAAGA 


(SBQ 


ID 


lfO:49) ; 


OGPSAAAGAAT 


(SEQ 


ID 


HO: 51) ; 


GAJU^GAATQA 


(SEQ 


ID 


HO: 53) ; 


AAOAATGAGA 


(SEQ 


ZD 


MOtSS) f 


OAATOAGACa 


(SBQ 


ID 


NO: 57) ; 


ATOAOACaAa 


(SBQ 


ID 


MO:5d) ; 


OAGACQAGCT 


(«BQ 


ID 


HO: 61) ; 


QAOSAGCTGA 


(SEQ 


ID 


HO: 63) I 


OaAGCTOAGC 


(SBQ 


ID 


MOt65) ; 


AOGTOAOCCA 


(SEQ 


ID 


NO:67) ; 


CTGAQCCAGC 


(SEQ 


ID 


HO:69) } 


GAGGCAGCAO 


(SEQ 


ID 


»0:7l) i 


GCCAGCAGCA 


(SEQ 


ID 


HO:73) ; 


CAOCAQCAGA 


(SBQ 


ID 


HO:7S) ; 


GCASCAGATO 


(SBQ 


ID 


NO:77) ; 


* ^m.^mt^ ^^^^^^^^^ 

AOCAOATGOa 


(SBQ 




MiO * *7 Q \ « 
WW • / 7 i J 






ID 


110:81) ; 






ID 


110*83) : 






ID 


HO* 85) ; 


GOGfTGGGAOC 


(SEQ 


ID 


MO i 8 /} ; 


OTGCSQAGCAG 


(SEQ 


ID 


80:89) ; 


GGOAGCAGTA 


(SEQ 


ID 


NO: 91) ; 


QAGCAGTATC 


(SBQ 


ID 


NO: 93) ; 


GCAGTATCTC 


(SBQ 


ID 


NO: 95) ; 


AGTATCTCGA 


(SBQ 


ID 


NO: 97) ; 
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GTATCTCGAG 

ATCTCGAG2^C 

CTCaAOACCT 

COAOACCTAa 

ASACCTAQAA. 

ACCTAQAAAA 

CTAGAAAAAC 

AOAAAAACAT 

AAIU^GATGO 

AAACATGGAG 

ACATGXSAGCA 

ATG6AGCAAT 

GOASCAATCSl 

AGCAATCACA 

CAATCACAAO 

ATCACAAQTA 

CACAAGTAGC 

CAAGTAOCAA 

AOTAI3CAATA 

TAOCAATACA 

GCAATAGAQC 

AATAGAOCAS 

TACAOCavOCT 

CAGCAGCTAA 

OCAOCTAACA 

AGCTAACAAT 

CTAACAATGC 

AACAATOCTO 

CAATGCTGCT 

ATGCTGCTTG 

GCTOCTTOTO 

TGCTTGTGCC 

CTTGTGCCTG 

TGTOCCTGQC 

TGCCTGGCTA 

CCTGQCTAGA 

TOGCTAGAAG 

GCTAGAAGCA 



(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SBQ ZD 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SBQ ID 
(SBQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SBQ ZD 
(SBQ ID 
(SEQ ID 
(SEQ ID 
(SBQ ID 
(SBQ ID 
(SBQ ID 



170:98) ; 
NO: 100) ; 
NO: 102) ; 
NO: 104) ; 
NO; 106) ; 
NO: 108) ; 
NO: 110) ; 
NO: 11.2) 
NO: 114) 
NO: 116) 
NO:118) ; 
NO: 120) 
NO: 122) 
MO:124) 
H0:126) 
NO: 126) 
NO: 130) I 
NO: 132) / 
NO: 134) ; 
NO: 136) ; 
NO: 138) ; 
NO:140) ; 
NO: 142) 
NO: 144) 
NO: 146) 
NO: 148) 
HO:150) 
NO: 152) 
NO: 154) 
NO: 156} 
N0:15S) ; 
NO: 160) ; 
NO: 162) ; 
NO: 164) } 
NO: 166) ; 
NO: 168} ; 
NO s 170) I 
NO: 172) ? 



TATCTCGAGA 

TCTOOAOACC 

TOGAGACCTA 

OAQACCTAGA 

OACXTTAGAAA 

CCTlbQAAAAA 

TAGAAAAACA 

GAAAAACATO 

AAAACATGGA 

AAGATGGAGC 

CATOGAGCAA 

TOOAGCAATC 

(»aC»ATCAC 

GCAATCACAA 

AATCACAAQT 

TOhCAAGTAG 

ACMAOTAGCA 

AAOTAGCAAT 

GmaCAATAC 

AOGIkATACAQI 

CSAATACAOCA 

A^XACAOCAOC 

AGAGCAGCTA 

AGCAGCTAAC 

CASCTAACAA 

GCTAACAATQ 

TAACAATGCT 

ACAATOCTOC 

AATOCTGCTT 

TQCTGCTTGT 

CTOCTTGTQC 

OCTTGTQCCT 

TTGT6CCTGG 

GrTQCCTGGCT 

GCXrrGOCTAO 

CTGGCTA6AA 

OGCTAQAAGC 

CTAGAA6CAC 



(SEQ ID 110:99); 

(SBQ ID NO:101); 
(8SQ ID NO: 103] 
(SBQ ID 110:105) 
(S«Q ID 110:107] 
(SBQ ID 110:109] 
(SBQ ID NO: 111] 
(SBQ ID NO: 113] 
(SBQ ID 110:115] 
(OTQ ID NO: 117] 
(SBQ ID 110:119] 
(SBQ ID 110:121] 
(8BQ ID 110:123] 
(SBQ ID 110:125] 
(SBQ ID NO: 127] 
(SBQ ID MO: 12 9] 
(SBQ ID NO: 131] 
(SBQ ID MO: 133] 
(SBQ ID MO:135) ; 
(SBQ ZD 110:137) / 
(SBQ IB 110:139] 
(SBQ ID 110:141] 
(SBQ ID 110:143] 
(^Q ID 110:145] 
(SBQ ID MO: 147] 
(SBQ ID MO: 149] 
(SBQ ID NO: 151] 
(SBQ ID MO:153] 
(SBQ ID NO: 155) 
(SBQ ID NO: 157] 
(SBQ ZD NO: 159] 
(SBQ ID NO: 161) ; 
(SEQ ID MO: 163] 
(SEQ ID NO: 165] 
(SBQ ID NO: 167) ; 
(SEQ ID MO: 169] 
(SBQ ID NO: 171] 
(SEQ ID NO: 173) ; 
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(SSQ ID NO: 174) ; 


AOAAGCACAA 


(SSQ 


ID 


NO: 175) ; 


OAAGCACAAO 


(SEQ ID NO: 176} ; 


AAGCACAAGA 


(SEQ 


ID 


NO: 177) ; 


ASCACAAGAG 


(SEQ ID NO I 176) / 


GCACAAGAGG 


(SBQ 


ID 


NO: 179) ; 


CACAAGAGGA 


(SEQ ID NO: 180) ; 


ACAAOAGGAG 


iSXQ 


ID 


NO: 181) ; 


CAAGAGGAGG 


(SEQ ID N0:1S2) / 


AAOAQGAGGA 


(«Q 


ID 


NO: 183) ; 


AGAGGAGGAA 


(SEQ ID NO: 184) ; 


GAGGAGQAAa 


(SBQ 


ID 


NO:185) ; 


AGGAGGAAGA 


(SEQ ID NO: 186) ; 


GGASGAAGAO 


(SSQ 


ID 


MO: 187) ; 


6AQGAAGAGG 


(SEQ ID NO: 188) ; 


AOGAAGAGGT 


(SBQ 


ID 


NO: 189) ; 


GGAAGAGGT6 


(SEQ ID NO: 190) ; 


GAA8AGGTGG 


(SBQ 


ID 


NO: 191) ; 


AA6AGGTGGG 


(SEQ ID NO: 193) ; 


A^jAGGTGGOT 


(SBQ 


ID 


NO:193) ; 


QAGGTGGGTT 


(SEQ ID NO: 194) / 


AflSTOGGTTT 


(SBQ 


ID 


NO:195) ; 


GOTOGOTTTT 


(SEQ ID NO t 196) ; 


araaoTTTTC 


(SSQ 


ID 


NO: 197) / 


TGQOTTTTCC 


(SEQ ID NO: 198) ; 


GQdTTTTCCA 


(SSQ 


ID 


NO: 199) ; 


GQTTTTCCAG 


(SEQ ID NO: 200) } 


OTTTTCCAGT 


(m 


ID 


NO:201) ; 


TTTTCCAGTC 


(SEQ ID NO;202) ; 


TTTCCAOTCA 


(SBQ 


ID 


NO: 203) ; 


TTCCAGTCAC 


(SEQ ID NO: 204) ; 


TOCAOTCACA 


(m 


ID 


NO: 205) ; 


CCAGTCACAC 


(SEQ ID NO: 306) ; 


CAOTGACACC 


(8«Q 


ZD 


MOi207) / 


AGTCACACCT 


(SEQ ID NO:208) ; 


GTCACACCTC 


(SBQ 


ID 


MO .-309) ; 


TCACACCTCA 


(SEQ ID NO: 210) f 


C3VCACCTCAO 


(SBQ 


ID 


N0:211)/ 


ACACCTCAGO 


(SEQ ID NO: 212) ; 


CaUSGTCAOGT 


(SBQ 


ZD 


N0t313) ; 


ACCTCAGOTA 


(SEQ ID NO:214); 


CCFGAGQTAC 


(SBQ 


ID 


110:215) ; 


CTCAGOTACC 


i3EQ ID NO: 216) ; 


TCMIOTACCT 


(SBQ 


XO 


NO (217) ; 


CAGQTACCTT 


(SEQ ID NO: 218) ; 


AOOTACCTTT 


(SBQ 


ZD 


NO:219) J 


GGTACCTTTA 


(SEQ ID NO: 220) ; 


GTACCTTTAA 


(fffiQ 


ZD 


110:221} ; 


TACCTTTAAO 


(SEQ ID NO: 222) ; 


ACCTTTAAOA 


(SBQ 


ZD 


NO: 223) ; 


CCTTTAAGAC 


(SSQ ID NO: 224) ; 


CTTTAAGACC 


{SBQ 


IB 


NO: 225) ; 




\S£«y ZD NO: 226 1 ; 


ffMlvik lA^ii r%^t^ 

TTAAGACCAA, 




ZD 


NO:227) ; 




(SEO ID KrO-22d) • 






ZD 


NO: 229) j 


AGACCAATGA. 


(SEQ ID NO: 230) ; 


OACCAATGAC 


(BtQ 


ID 


NO: 231) ; 


ACCAATGACT 


(SEQ ID NO; 232); 


GCAATQACTT 


(SEQ 


ID 


N0:233); 


CAATQACTTA 


(SEQ ID NO: 234) ; 


AATCACTTAC 


(8IQ 


ID NO: 235); 


ATGACTTACA 


(SEQ ID NO: 236) ; 


imCTTACIAA 


(SSQ 


ID 


NO: 237) ; 


GACTTACAAG 


(SEQ ID NO: 238) ; 


ACTTACAAGO 


(SSQ 


ZD 


NO: 239) ; 


CTTACAAGGC 


(SEQ ID NO: 240); 


TTACAAGGCA 


(SSQ 


ID 


N0:241); 


TAGAAGGCAQ 


(SEQ ID NO: 242} ; 


ACAAGGCAGC 


(SSQ 


ID MO: 243) ; 


CAAGGCAGCT 


(SEQ ID NO: 244) ; 


AAGOCAGCTG 


(SSQ 


ID MO:245) ; 


AG6CA6CTQT 


(SEQ ID NO: 246) ; 


OGCAGCTGTA 


(SEQ 


ID 


NO: 247) ; 


6CAGCTGTAG 


(SEQ ID NO: 248} ; 


CAGCrrGTAGA 


(SSQ 


ID 


NO: 249) ; 
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AGCTCTAGAT 


(SSQ 


ID 


NO: 250) ; 


OCrraTAOATC 


(SBQ 


ID NO: 251) ; 




CTGTAOATCT 


(SBQ 


ID 


NO:252) ; 


TGTAGATCTT 


(8EQ 


ID NO: 253) ; 




GTAOATCTTA 


(SBO 


ID 


NO: 254) ; 


TAOATCTTAG 


(SBQ 


ID NO: 255} ; 




AGATCrrTAQC 


(SEQ 


ID NO: 256) ; 


QATCTTAGCC 


(SBQ 


ID NO: 257) ; 




ATCTTAGCCA 


(SEQ 


ID 


NO: 258) ; 


TCTTAOCCAC 


(i^Q 


ID NO:259) ; 




CTTAGCCACT 


(SBQ 


ID 


NO: 260) ; 


TTAOOCACTT 


(SBQ 


ID MO: 261) ; 




TAGCCACTTT 


(SEQ 


ID 


NO: 262) ; 


AGCCACTTTT 


(68Q 


ID MO:263) ; 




GCCACiTITT 


(SBQ 


ID 


NO: 264) ; 


CCXCTTTTTA 


(SEQ 


ID MO: 265) ; 




CACTTTTTAA 


(SBQ 


ID 


NO: 266) ; 


ACTTTTTAAA 


(SBQ 


ID MO: 267) ; 




CTTTTTAAAA 


(SEQ 


ID 


NO: 268) ; 


"mTTAAAAG 


(SBQ 


ID MO: 269) ; 




TTTTA7UVAQA 


(SEQ 


ID 


NO: 270) ; 


TTTAAAAGAA 


(SBQ 


ID MO:27l) ; 




TTAAAAGAAA 


(SEQ 


ID 


NO: 272) ; 


TAAAAGAAAA 


(BMQ 


ID MO:273) ; 




AAAAGAAAAG 


(SSQ 


ID 


NO: 274) ; 


AAAOAAAAOO 


(SBQ 


ID MO: 275) i 




AAGAAAAGGG 


(SEQ 


ID 


NO: 276) ; 


AOAAAAOGOG 


(SBQ 


ID MOs277) ; 




GAAAAGGGGQ 


(SEQ 


ID 


NO: 278) ; 


AAAAGGGG66 


(8BQ 


ID MO: 279) ; 




AAAGOGGOOA 


(SEQ 


ID 


NO: 280) ; 


AA0OOGGGAC 


(SBQ 


ID MO: 281) i 




AGGGGGGACT 


(SEQ 


ID 


NO: 282) ; 


aodoaoAcro 


(MDQ 


ID MO:283) ; 


'-J 


OGGGGACTGG 


(SBQ 


ID 


N0:284) ; 


OaaOACTGOA 


(OBQ 


ID MO: 285) , 




GGGACTGGTU^ 


(SBQ 


ID NO: 286) i 


aaACTQGAAO 


(SBQ 


ID MO: 287) ; 




QACTOGAAGO 


(SEQ 


ID 


N0:288) ; 


ACTGGAAGGG 


(SBQ 


ID MO:289) ; 




CTGGAAGGGC 


(SEQ 


ID 


NO:290) ) 


TGCaU^GOOCT 


(SBQ 


ID MO: 291) 7 


ill 


OGAAGGGCTA 


(SBQ 


ID NOt292) ; 


GAAOOOCTAA 


(SBQ 


ID MO:293) ; 




AAGG6CTAAT 


(SEQ 


ID NOt294) ; 


AOOOCTAATT 


(SBQ 


ID MOi295) ; 




GGGCTAATTC 


(SEQ 


ID 


NO:296) ; 


OQCTAATTGA 


(raQ 


ID NO: 297) ; 


m 


GCTAATTCAC 


(SEQ 


ID 


NO! 298) 1 


CTAATTCACT 


(SBQ 


ID MO: 299) } 




TAATTCACTC 


(SEQ 


ID 


NO;300) t 


AATTCACTCC 


(SBQ 


ID NO: 301) ; 




ATTCACTCCC 


(SEQ 


ID 


NO:302) ; 


TTCACTCCCA 


(SBQ 


ID NO: 303) ; 




TCACTCCCAA 


(SEQ 


ID NOt304) ; 




(SBQ 


ID MO:305) ; 






(SBQ 


ID 


MO:a06) 1 


CTGCCAAAOA 


(SBQ 


ID MO: 307) ; 






(SEQ 


ID NO: 308) ; 


C(XAAAGAAO 


(SBQ 


ID NO: 309) ; 




r*r*AAAnAAfiA 


(SEQ ID NO: 310); 


CAAA0AAC3AC 


(SBQ ID NO: 311) ; 






(SEQ 


ID NO: 312) ; 


AAOAAOACAA 


(SBQ ID NO: 313) ; 






(SBQ ZD NO:314) ; 


GAA^SACAAlGA 


(SBQ 


ID NO: 315) ; 




AAOACAAGAT 


(SEQ 


ID NO: 316) ; 


AOACAAOATA 


(SBQ 


ZD NO t 317) t 




GACAAGATAT 


(SEQ 


ID 


NO:318) ; 


ACAAGATATC 


(SEQ ID NO: 319) ; 




CAAOATATCC 


(SEQ ZD NO: 320) ; 


AAGATATCCT 


(SBQ 


ID NO: 321) 1 




AGATATCCTT 


(SEQ ID NO: 322) ; 


GATATccrra 


(SEQ ID NO: 323) ; 




ATATCCTTGA 


(SEQ 


ID NO: 324) ; 


TATCCTTGAT 


(SEQ 


ID NO: 325) 7 
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ATCCTTGATC 


(SEQ 


ID 


NO:326) . 




TCCTTOATCT 


(SBQ 


ID 


MO:327) ; 


CCTTGATCTG 


(SEQ 


ID 


NOi32a) , 




CTTOATCTGT 


(SBQ 


ID 


MOt329) ; 


TTGATCTGTO 


(SEQ 


ID 


NO: 330) , 




TOATCTOTOO 


(SBQ 


ID 


M0:331) ; 


QATCTGTGOA 


(SEQ 


ID 


NO:332) , 




ATCT8TGGAT 


(SBQ 


ID 


H0:333) ; 


TCTGTGGATC 


(SBQ 


ID 


NO:334) , 




CTOTOGATCT 


(SBQ 


ID 


M0:33S) ; 


TGTGQATCTA 


(SEQ 


ID 


MO: 336) ; 


OTOGATCTAC 


(SBQ 


ID 


NO:337) ; 


TGGATCTACC 


(SBQ 


ID 


NO: 338) ; 


OGATCTACCA 


(SBQ 


ID 


NO: 339) ; 


GATCTACCAC 


(SBQ 


ID 


NO: 340} ; 


ATCTACCACA 


(SBQ 


ZD 


NO:341) i 


TCTACCACAC 


(SEQ 


ID 


NO: 342) ; 


CTACCACACA 


(SBQ 


ID 


110:343} ; 


TACCACACAC 


(SEQ 


ID 


N0:344) J 


ACCACACACA 


(SBQ 


ID 


NOt34S} ; 


CCACAGACAA 


(SEQ 


ID 


NO: 346) ; 


CACACACAAG 


(SIQ 


ID 


NO: 347) ; 


ACACACAAGG 


(SBQ 


ID 


NO:348) ; 


CACACAAOaC 


(SBQ 


ID 


110:349) ; 


ACACAAGGCT 


(SBQ 


ID 


NO t 350) ; 


CAfAAOGCTA 


(SBQ 


ID 


HO: 351) ; 


ACAAGGCTAC 


(SBQ 


ID 


NO: 352) ; 


CAAOaCTACT 


(SBQ 


ID 


NO: 353) ; 


AAGGCTACTT 


(SEQ 


ID 


NO: 354) ; 


AGGCTACTTC 


(SBQ 


ID 


N0:3SS) ; 


aaCTACTTCC 


(SEQ 


ID 


NO:356) t 


OCTACTTCCC 


(SBQ 


ID 


NO:357) ; 


CTACTTCCCT 


(SEQ 


ID 


NO:3S8) ; 


TAGTTCCCT6 


(SBQ 


ID 


MO:359) ; 


ACTTCCCTGA 


(SEQ 


ID 


NO t 360) I 


r 


CTTCCCTCAT 


(SBQ 


ID 


HO: 361) ; 


TTCCCTQATT 


(SEQ 


ID 


NO: 362) ; 


TCeCTGATTO 


(MQ 


ID 


NO:363) ; 


CCCTOATTGO 


(SEQ 


ID 


NO: 364) ; 


CCTOATTOGC 


(OQ 


ID 


HO:36S) ; 


craATTOQcai 


(SEQ 


ID 


NO: 366) ; 


TOATTQGCAG 


(SBQ 


XD 


110:367) ; 


QATTGGCAQA 


(SEQ 


ID 


NO: 368) ; 


ATVaSCAOAA 


(SBQ 


IS 


NO:369) ; 


TTGGCAGAAC 


(SEQ 


ID 


NO: 370) ; 


rraSCAOAACT 


(m 


ID 


NO: 371) ; 


GGGAGAACTA 


(SBQ 


ID 


NO:372) I 


OCAtaAACTAC 


(SBQ 


ID 


110:373} ; 


CAGAACTACA 


(SEQ 


ID 


NO: 374) f 


AGAACTAC3U: 


(SBQ 


ID 


110:375) f 


GAACTACACA 


(SEQ 


ID 


NO: 376) ; 


AACTACACAC 


(SBQ 


ID 


HO: 377) ; 


ACTACACACC 


(SBQ 


ID 


NO: 378) 


r 


CTACACACCA 


(SBQ 


ID 


110:379) ; 






ID 


NOx380) 


< 




(SBQ 


ID 


110:381) ; 


rACACCAGGG 




ID 


NO: 382) 


t 


AGACCAGGGC 


(SBQ 


ID 


NO:383) ; 


CACCAGCSQCC 


(SEQ 


ID 


NO: 384) 


i 


ACCAGGGCCA 


(SEQ 


ID 


NO:385) ; 


CCAGGGCCAG 


(SEO 


ID 


NO:386) 


t 


CAOOGCCAGO 


(SBQ 


ID M0:387} ; 


AGGGCCAGGO 


(SEQ 


ID NO: 388) 


m 
1 


GGGOCAGGGG 


(SBQ ID NO:389) ; 






ID 


NO: 390) 


i 




(SBQ ID NO: 391) ; 


CCAGGC3GTCA 


(SEQ 


ZD 


NO:392) 


I 


CAGGGGTCAG 


(SBQ 


ID 


NO:393) ; 


AGGQGTCAOA 


(SEQ 


ID 


NO:394) 


ft 
9 


(XiGcrrcAGAT 


(SBQ ID NO:39S) i 


GG6TCAGATA 


(SEQ 


ID 


NO: 396) 


9 


OQTCAGATAT 


(SEQ 


ID 


NO:397) ; 


OTCAGATATC 


(SBQ 


ID 


NO:398) 


1 


TCAGATATCC 


(SEQ 


ZD NO:399) ; 


CAGATATCCA 


(SBQ 


ID 


NO: 400) 


s 


AGATATCCAC 


(SEQ 


ID 


NO:401) ; 
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GATATCCACT (SBQ ID NO:402) ; 

TATCCACTGA ( SBQ ID NO : 4 04 } ; 

TCCACT6ACC (SEQ ID MO: 406) ; 

CACTGACCTT (SEQ ID N0:408) ; 

CTOACCTTTa <SBQ ID N0:410) ; 

GACCTTTOGA (SEQ ID NOt412); 

CCTTTaOATa (SEQ ID NOf414) ; 

TTTOGATGOT (SBQ ID NO:41fS} ; 

TGGATGGT6C (SSQ ID MO: 418} ; 

GATGOTOCTA (SEQ ID NO t 430) / 

TGGTGCTACA. (SEQ ID NO: 422) ; 

GTGCTACAAG (SBQ ID NO: 424) ; 
GCTACAAGCT (SEQ ID NO:42€) ; 
TACAAGCTAa (SEQ ID NO t 428) ; 
CAA8CTAGTA (SEQ ID NO:430) ; 
AOCTAQTACC (SEQ ID K0:432) ; 
CTAGTACCAG (SEQ ID NO:434) ; 
AOTACCAGTT (SBQ ID MO:43C) ; 
TACCAGTTGA (SEQ ID NO:438) ; 
CCAGTTGAQC (SBQ ID NO:440) ; 
AOTTGASCCA (SEQ ID N0:443) ; 
TTOAGCGAOA (SEQ ID NO: 444) ; 
GAOCX»OATA (SEQ ID NO:44() ; 
GCCAGATAAO (SEQ ID NO:448) ; 
CAGATAAG6T (SEQ ID NOt450) ; 
GATAAGGTAQ (SBQ ID NO: 452) ; 
TAAOaTAOAA (SBQ ID NO: 454) ; 
AGGTAGAAGA (SEQ ID NO: 456) ; 
OTAGAAGAGG (SEQ ID NOi458) ; 
AGAAQAGGCC (SEQ ID NO:4$0) ; 
AAGAGGCC&A (SEQ ID NO: 462) t 
GAGG(XAATA (SEQ ID NO: 4 64) ; 
QGCCAATAAA (SEQ ID MO:466) ; 
CCAATAAAGO (SBQ ID NO:46B) ; 
AATAAAGGAG (SEQ ID NO J 470) ; 
TAAAGOAOAO (SEQ ID NO: 472) ; 
AAGGA,GAGAA (SEQ ID NO: 474) ; 
GGAGAGAACA (SEQ ID NO: 476) ; 
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ATATCCACTG (SBQ ID KO:403) ; 

ATCCACTGAC (SBQ ID NO: 405) ; 

CCACTQACCT (SBQ ID MO: 407) ; 

ACTOACCTTT (ffiiQ ID 110:409) ; 

TOACCTTTGG (SBQ ID N0:411) ; 

ACCTTTGGAT (SBQ ID NO:413) ; 

CnrOGATGG (4«Q ID NO:415) ; 

TTQOATGGrG (BBQ ID NOt417) ; 

GGATGGTOCT (SBQ ID NO: 419) ; 

AT09TOCTAC (KtQ ID NO: 421) ; 

OGVOCTACAA (SBQ lO 110:423) ; 

TOCTACAAfiC (SBQ ID HO: 425) ; 

CTACIMGCTA (MQ ID il0:427) / 
AC3UW3CTAGT (SBQ ID 110:429) ; 
AASGTAaTAC (SBQ ID 110:431) ; 
GCTAOTACCA (SBQ ID 110:433); 
TAGrTACCAQT (SBQ ID 110:435) ; 
OT^CAGTTO (SBQ ID MO: 43 7) ; 
ACGAOrrOAG (raQ ID HO: 439) ; 
CMirTGAGGC IBBQ IS 110:441) ; 
arfKUaCGAa (SBQ is Ii0:443); 
raftOCCAGAT (SBQ ID NO: 445) ; 
AOeCAGATAA (MQ ID MO:447) ; 
CCAOATAAGG (SBQ ZD MO:449) ; 
AGATAAOGTA (SBQ ID MO: 451) ; 
ATMX3GTAQA (SBQ ID MO: 453) ; 
AAOGTAGAAa (SBQ ID 110:455); 
GGTAGAAGAO i«tQ IS 180:457) ; 
TAQJUiGAGGC (SBQ ID MO:4S9) ; 
dMOhQaCCh (8IQ IS MO: 461) ; 
ASftSGCCAAT (SBQ ID MO:463) ; 
AfiGCCAATAA (SEQ ID 110:465) ; 
OCCAXTAAAG (SBQ IS Iil0:467); 
C3UITAAA(3GA (81Q ID NO: 4 69); 
ATAMGGAGA (SEQ ID N0:471) ; 
AAA£aaAaAaA (SBQ id N0:473); 
AOGAGAGAAC (SBQ IS M0:475) ; 
GAOAGAACAC (SEQ ID NO: 477) ; 
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AGAGHAOlCC 


(SEQ 


ID 


NO: 478 


AGAACACCAG 


(SBQ 


ID 


NO i 460 


AACACCAGCT 


(SEQ 


ID 


NO :48a 


CACCAOCTTQ 


(SEQ 


ID 


NO: 464 


CCAOCTTOTT 


(SEQ 


ID 


NO: 486 


AQCTTOTTAC 


(SEQ 


ID 


NO: 488 


CTTQTTACAC 


(SBQ 


ID 


NO: 490 


TOTTACACCC 


(SEQ 


ID 


NO: 4 92 


TTACACCCra 


(SBQ 


ID 


NO: 494. 


ACACCCTOTG 


(SBQ 


ID 


NO: 496] 


ACCCTOTGAG 


(SEQ 


ID 


MO: 498] 


CCTOTGAGCC 


(SEQ 


ID 


NO: 500] 




(SEQ 


ID 


NO: 502] 


TGAGCCTOCA 


(SBQ 


ID 


NO: 504] 


AGCCTGCAT6 


(SEQ 


ID 


NO: 506] 


CCTGCATGGA 


(SEQ 


ID 


NO: 508} 


IVSCATGGAAT 


(SEQ 


ID 


NO: 510) 


CATGGAATGO 


(SBQ 


ID 


NO: 512) 


TQGAATGQAT 


(SEQ 


ID 


NO: 514} 


GAATaaATOA 


(SEQ 


ID 


N0:516) 


ATQOATGACC 


(SEQ 


ID 


NO:Sia) 


OGATOACCCT 


(SBQ 


ID 


NO: 520) 


ATOACCCTGA 


(SBQ 


ID 


NO 1 522) 


GACCCTGAGA 


(SEQ 


ID 


MO: 524) 


CCCTGAGAGA 


(SEQ 


ID 


NO: 526) 










GA.GAGAGAA6 


(SEQ 


ID 


NO:590) 


OAOA^SAAOTO 


(SEQ 


ID 


NO: 532) 


GAOAAQTOTT 


(SEQ 


ID 


NO: 534) 


GAAGTQTTAa 


(SBQ 


ID 


N0:S3€) 


AGT6TTAGAG 


(SEQ 


ID 


NO: 538) 


TQTTAOAGTG 


(SEQ 


ID 


NO: 540) 


TTAGAQTGGA 


(SEQ 


ID 


NO: 542) 


AGACTTGGAGG 


(SEQ 


ID 


NO: 544) 


AGTC3GAGGTT 


(SEQ 


ID 


NO: 546) 


TGGAGGTTTO 


(SEQ 


ID 


NO :54a) 


CAGGTTTGAC 


(SEQ 


ID 


NO: 550) 


GGTTTQACAO 


(SEQ 


ID 


NO: 552) 
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QAOAACACCA 


(SSQ 


ID 


BO: 479) ; 


OAAGACCASC 


(SEQ 


ID 


HO: 481) ; 


ACACCAGCTT 


(SBQ 


ID 


NO:4e3) ; 


ACCAGCTTGST 


(SBQ 


ID 


NO:485) ; 


CAOGTTGTTA 


(SBQ 


ID 


MO: 487) ; 


GCrVGrTTACA 


(88BQ 


ID 


BO:489} ; 


TTOTTACACC 


(«IQ 


ID 


H0t49l) ; 


OTTACACCCT 


(SIQ 


ID 


1K>:493) i 


TACACCCTOT 


(SBQ 


ID 


NO:495) ; 


CACCCTOTGA 


(SBQ 


ID 


NO: 497} ; 


COCTSTOAQC 


(SBQ 


10 


N0:499} ; 


CTOTOAGCCT 


(SBQ 


ID 


MO: 501) ; 


OTOAGCCTGC 


(BBQ 


ID 


NOt503) / 


6MS0CT6CAT 


(SBQ 


ID 


NO:505} ; 


GGGTQCATG6 


(SBQ 


ID 


NO: 507) ; 


cTscaiTaaAA 


(SBQ 


ID 


H0:509} ; 


OGATOQAATO 


(SBQ 


ID 


MOiSll) / 


ATOGUUWTOGA 


(SBQ 


ID 


110:513} 1 


OGakATGGATa 


(SBQ 


ID 


HOtSlS) ; 


AAHSOATGAC 


(rao 


ID 


IK>iS17) 1 


TOGHTGACCC 


(SBQ 


IS 


110:519} ; 


OATOACCxrro 


(flSQ 


ID 


NO:521) ; 


tqaccx;t(3ao 


(SSQ 


ID 


H0:523) ; 


ACGCTGAGAa 


(SSQ 


ID 


H0:S2S) ; 




(SBQ 


ZD 


MO:527} ; 






xaj 




AfiXGlUSAAGT 


(SBQ 


ID 


NO: 531) } 


ASAGAAGTGT 


(SBQ 


ID 


MO: 533) ; 


AOAASTOTTA 


(SBQ 


ID 


110:535) ; 


AMTTOTTAQA 


tSEQ 


ID 


NO: 537) ; 


(rrorTAcsAGT 


iSBQ 


ZD 


MO: 539) ; 


QTTAGAGTGO 


(SMQ 


IB 


MO: 541) ; 




(SBQ 


ID 


NO:543) ; 


(SAGiTOOAGOT 


(SBQ 


ID 


NO: 545) ; 


OTGCtlUSGTTT 


(l^Q 


ID 


NO: 547) ; 


(3QM3GTTT(3A 


(SBQ 


IB 


MO: 549) ; 


AOGTTTGACA 


(SEQ 


ID 


NO: 551) ; 


(3TTTGACAGC 


(SBQ 


ID 


NO: 553) ; 
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TTTGACAGCC (SEQ ID N0tS54); 

TGACAOCCOC (SBQ ID N0:556) ; 

ACAOCCQCCT (SEQ ID NO:558); 

AQCCGCCTAG (SBQ ID MO:S60) ; 

CCOCCTAQCA, (SEQ ID NO:562); 

GCCTAGCATT (SEQ ID NO:5fi4) ; 

CTAGCATTTC (SEQ ID NO:566) ; 

AOCATTTCAT (SEQ ID NO: 566); 

CATTTCATCA (SEQ ID WO:570); 

TTTCATCACa (SEQ ID MO:572) ; 

TCATCACGTG (SEQ ID N0:574) ; 
ATCACOTOOC (SEQ ID MO t S7« ) | 
CACOTOGCCC (SEQ ID NO: 578); 
CGTOOCCCOA (SBQ ID NO:580) ; 
TGGCCCGAGA (SEQ ID NO: 582); 
OCCCGAOAOC (SBQ ID NO: 584); 
CCOAOAGCTG (SBQ ZD NO: 586); 
OAOAGCTQCA (SEQ ID NO: 588); 
OAaCTOCATC (SEQ ID NO: 590); 
GCTGCATCCG (SEQ ID NO: 592); 
TOCATCCOGA (SBQ ID NO: 594) ; 
CATCOGGAGrr (SEQ ID NO: 596); 
TCCGGAGTAC (SEQ ID NO: 598); 
OCkSAGTACTT (SBQ TD NO: 600); 
GAGTACTTCA (SEQ ID NO: 602) ; 
GTACTTCAAQ (SEQ ID NO: 604); 
ACTTCAAGAA (SEQ ID NO: 606); 
TTCAAGAACT (SEQ ID NO: 608) ; 
CAA6AACTGC (SBQ ID NO: 610) ; 
A6AACT6CTG (SBQ ID NO; 612) ; 
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TTGACAOCCO {SBQ ID 1R):555); 
CSACAOCCOCC (SBQ ID NO:SS7) ; 
CAGCCOCCTA {aSQ ID NO; 559) ; 
OCCXaCCTAOC (SaO id NOi56l); 
COOCTAGCAT {OmQ ID NO;563); 
CX:TAiaCATTT (8BQ ID MO: 565) ; 
TASGATTTCA (OQ ID 110:567); 
OCATTTCATC (ffllEQ ID 110:569); 
ATTTCATGAC (SBQ ID 110:571); 
TTCATCAOGT (SBQ ID NO:573) ; 
CATGAGQTGG (SBQ ID 110:575) ; 

TCAcaraacc (sbq id mo : 577); 

ACXSTGGCCCXS (SBQ ID 110:579); 
0TG6CCCGAG (SBQ ID tlOt581) ; 
<30GCCGAGAO (SBQ ID HO:583) ; 
CCGGAOAGCT (SBQ ID NO:S85) ; 
OOAOAGCTGC (flBQ ID MO:587) ; 
AOAGCTOCAT (SBQ ID 110:589); 
AOCTOCATCC (SBQ ID NO: 591) ; 
CTOCATCCGa (SBQ ID 110:593); 
OGATCCGGAa (SBQ ZD 110:595); 
ATCCX3GAGTA (SBQ ZD 110x597); 
COGGAGTACT (SBQ ZD MO:599) ; 
QOAGTACTTC (SBQ ID MO: 601) ; 
AOTACTTCAA (SBQ ID MO:603) ; 
TACTTCAAGA (SBQ ID NO: 605) ; 
CTTCAAGAAC (SBQ ID MO:607) ; 
TCMOAACTG (SBQ ID NO: 609) ; 
AAOftACTGCT (SBQ ID 110:611} ; 
CAACTGCTCSA (SBQ ID N0:$13) . 



44. A method according to claim 3S wherein said deletioo eacompuses at least one 
of Hat following decanudcotidc deletkuis horn, die nefgw. 
ACCA6CTTGT [SBQ ID NO: 803] CCAGCTTGTT {BEQ ID NO: 6 04] 
CAGCTTGTTA [SEQ ID NO: 805] AGCTTGTTAC (SEQ ID NO: 806] 
GCTTGTTACA [SEQ ID MO: 807] CTTTCnTACAC (SEQ ID NO: 808] 
TTGTTACACC [SEQ ID NO:809] TOTTACACCC [SBQ ID NO:810l 
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GTTACACCCT 


[SEQ 


ID 


NO: 


811] 


TACACCCTGT 


CSEQ 


ID 


NO: 


813] 


CACCCTGSTOA 


[SEQ 


ID 


NO: 


815] 






ID 


NO: 


817] 






ID 


NO: 


819] 






ID 


NO: 


821] 




[SBQ 


ID 


NO: 


823] 


OCCTGCATGG 


[SEQ 


ID 


NO: 


825] 


CT6CAT6GAA. 


[SEQ 


ID 


NO: 


827] 






TD 


NO: 


829] 


ATGGAATGGA. 


[SEQ 


ID 


NO: 


831] 


GGAATOOATQ 


[SEQ 


ID 


NO: 


833] 




[SEQ 


ID 


NO: 


835] 


TGGATGACCC 


[SEQ 


ID 


NO: 


837] 


QATGACCCTG 


[SEQ 


ID 


NO: 


839] 


TGACCCTOAO 


[SEQ 


ID 


NO: 


841] 
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TTXCACCCTO 


[8CQ 


ID 


NO: 812] 


ACXCCCTGTO 


[8BQ 


ID 


NO: 814} 


ACCCTOTGUia 


(CIQ 


ID 


SO:816] 


CCTGrrOAGCC 


ISBQ 


ID 


110:818] 


TCROACSCCTa 


tSMQ 


ID 


HO:820] 


rraAoccTocA 


cm 


ID 


110x822] 


AOCCTQCATa 




ZD 


110:824] 


CCTGCATOOA 




ZD 


ilOx826] 


TdCATaaAAT 


ISMQ 


ZD 


110:828] 


GATOOAATaO 


(88Q 


ID 


NO:830] 




C«BQ 


ZD 


MO:832] 




cm 


ID 


1K):834] 


ATOGATGUVCC 


cm 


ID 


1I0:83€] 


OOATQACCCT 




ID 


110:838] 


ATOACCCTGA 


CSBQ 


ID 


NO: 840] 



AS. A method according to cfaum 3S wherew nid deletkm 
of the foUowing decarmdeotidef ttom the LTR tccion of HfV-lm^: 



at least one 



GCTTTTTOCC 

TTTTTGCCTG 

TTTQCCTQTA 

TGCCTGTACT 

CCTGTACTGG 

TGTACTGOaT 

TACTGOOTCT 

CTGGGTCTCT 

GGGTCTCTCT 

GTCTCTCTGG 

CTCTCTQGTT 

CTCTOOTTAO 

CTGQTTAOAC 

OGTTAOACCA 

TTAGACCAGA 

AGACCA6A.TC 

ACCAGXTCTG 

CAQATCT6AO 



(SBQ ID NO: 652) ; 
(SBQ ID NO: 654) ; 
(SBQ ID NOt656) ; 
(SEQ ID NO: 658) ; 
(SBQ ID MO: 660) ; 
(SBQ ID NO: 662) / 
(SBQ ID NO: 664) ; 
(SBQ ID NO: 666) ; 
(SBQ ID NOx668) ; 
(SSQ ID NO: 670) ; 
(SEQ ID NO: 672) ; 
(SEQ ID NO: 674) ; 
(SEQ ID NO: 676) ; 
(SBQ ID NO: 678) ; 
(SEQ ID NO: 680) } 
(SEQ ID NO: 682) ; 
(SBQ ID NO: 684) } 
(SEQ ID NO: 686) ; 



CTTTrJlHSCCT 


(«BQ 


IB MO: 653) ; 


TTTTGCCTGT 


<8BQ 


ZD NO: 655) ; 


TTaCCTGTAC 


(SBQ 


ZD NO: 657) ; 


GCCrGTACTG 


(«EQ 


ZD NO: 659) ; 


CTGTACTGGO 


(SBQ 


ID MO: 661) ; 


OTACTOOOTC 


(SBQ 


ZD MOt663) I 


ACTGWGTCTC 


(SBQ 


ID NO: 665) ; 


TGGOTCTCTC 


(SBQ 


ID NO:667) ; 


GQTCTCTCTG 


(SSQ 


ID NO: 669) ; 


TCJCTCTGGT 


(SBQ 


ID MO t 671) ; 


TGTGKSGTTA 


(SSQ 


ID NO: 673) ; 


TGTCTGGTTA 


(SEQ 


ID NOx675) ; 


TOSTTAGACC 


(SEQ 


ID NO: 677) ; 


GTTAGACCAa 


(SEQ 


ID MO: 679) ; 


TAGACCAQAT 


(SBQ 


ID NO: 681) ; 


CtACCAGATCT 


(SEQ 


ID MO:683) / 


CCAOATCrOA 


(SEQ 


ID MO:685) ; 


AOATCTOAGC 


(SEQ 


ID NO: 687) ; 
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OATCTOAGCC 


(SEQ 


ID 


NO: 688) 


; 


ATCTGAGOrr 


(SBQ 


ID 


NO:689) 




TCTGAGCCTO 


(SEQ 


ID 


NO: 690) 


; 


CTGAGCCTGQ 


(SBQ 


ID 


NO: 691) 




TGAQCCTGGG 


(SEQ 


ID 


NO: 692) 


; 


GJusccraoax 


(SEQ 


ZD 


NO: 693) 




AGCCTGQGAG 


(SBQ 


ID 


NO:694) 




OCCTOGGAGC 


(SBQ 


ID 


NO: 695) 






(SBQ 


ZD 


NO: 696) 


f 


crroGCSAacTC 


(SEQ 


ID 


NO:697) 






(SEQ 


ID 


tIO:698) 


* 


GOOAGCTCTC 


(SEQ 


ID 


NO:699) 








ID 


130-700) 




QMCTCTCTG 


(SBQ 


ZD 


NO:701) 




MUrW X W X W4V9w 




ID 


NO: 702) 


i 


OCTCTCTGGC 


(SBQ 


ID 


NO: 703) 




W ^ ^ xvarvsw X 




ID 


NO : 704 ) 


t 


TCTCTGGCTA 


(SEQ 


ID 


NO:705) 




WX w AUViWxnn 




ID 


NO: 706) 


t 


TCTGSGCTAAC 


(SEQ 


ID 


NO: 707) 




Vm» X W X X 




ID 


NO: 708) 


i 
f 


T^kiCTAACTA 


(SEQ 


ZO 


NO:709) 






(SEC 


ID 


NO:710} 


t 

9 


dCTAACTAOa 


(SEQ 


ID 


NO:711) 




w x#%AW X r\vj\^v# 




ID 


NO:712) 


i 

m 


"n^ACTAOOGA 


(SBQ 


ZD 


NO: 713) 




^^f%W A 




ID 


NO: 714) 


J 
9 


ACmOGOAAC 


(SEQ 


ID 


NO:715) 






(SEQ 


ID 


NO: 716) 


t 


TA0GGAACCC 


(SBQ 


ZD 


NO: 717) 








ID N0:719) 


• 


OOCSAACCCAC 


(SEQ 


ID 


NO:719) 


=p 


ur%jlAAw WwJ%W X 




ID 


NO: 720} 




QAACCCACTCl 


(SEQ 


ZD 


NO: 721) 








ID 


NO: 722) 


i 


AOCXSACTOCT 


(SBQ 


WD 


NO: 723) 




V»w X X X 




ID 


NO: 724) 


f 


CCACTQCTTA 


(SEQ 


ZD 


NOi725) 




wAWXwwX X#W^ 




ID 


NO: 726) 


0 


ACTSCTTAAfil 


(SEQ 


ZD 


NO: 727) 




WXwWX XAMM'W 


(SBQ 


ID 


N0:72«) 


» 

9 


TOCTTAAOGC 


(SBQ 


TED 


H0:729) 






fSBQ 


ID 


NO: 710) 


m 


GTSAAOOCTC 


(SBQ 


ZD 


IK):731) 




X X ^sXV^J W W X X »gX 


(SBQ 


ID 


NO: 732) 




TAAOCCTCAA 


(SBQ 


ZD 


NO: 733) 




AAGCCTCAAT 


(SBQ 


ID 


NO: 734) 


m 


AOCCTCAATA 


(SBQ 


ID 


NO: 735) 




OCCTCAATAA. 


(SEQ 


ID 


NO: 736) 


« 

i 


CGTCAATAAA 


(SEQ 


ZD 


NO:737) 




CTCAATAAAO 


(SEQ 


ID 


NO: 738) 


t 


TCAATAAAOC 


(SEQ 


ZD 


NO: 739) 




CAATAAAOCT 


(SEQ 


ID 


NO: 740) 


• 


AATAAAGCTT 


(SBQ 


ZD 


NO: 741) 




ATAAAGCTTO 


(SBQ 


ID 


NO:742) 




TAAAGCTTGC 


(SBQ 


ZD 


NOt743) 




AAAGCTTOCC 


(SBQ 


ID 


NO: 744) 


/ 


AAGCTTGCCT 


(SEQ 


ID 


NO: 745) 




AGCTTGCCTT 


(SEQ 


ID 


NO: 746) 


9 


(3CTT6CCTTO 


(SEQ 


ID N0;747) 




CTTGCCTTGA 


(SEQ 


ID 


NO: 748] 


• 
f 


TTQCCTTGAO 


(SBQ 


ID 


NO:749} 




TQCCTTGAOT 


(SEQ ID NO: 750) 


m 
t 


OCCTKSAGTG 


(SBQ 


ID 


NO: 751) 




CCTTGAGTGC 


(SEQ 


ID 


NO: 752) 


1 ; 


CTTGAGTGCT 


(SEQ 


ID 


NO:753) 




TTGAGTGCTT 


(SEQ 


ID 


NO: 754] 




TGAGTGCTTC 


(SBQ 


ID 


NO: 755) 




GAGTGCTTCA 


(SEQ 


ID 


NO: 756] 


) 1 


A6TGCTTCAX 


(SEQ 


ID 


NO:757) 




GTGCTTCAAG 


(SEQ ID NO: 758' 


' # 


T(3CTTCAAGT 


(SEQ 


ID 


NO: 759] 




GCTTCAAQTA 


(SBQ 


ID 


NO: 760 


f f 


cttc:aagtag 


(SEQ 


ID 


NO: 761' 




TTCAAGTAGT 


(SEQ 


ID 


NO: 762 


1 « 
/ # 


TCAAGTAGTO 


(SEQ 


ID 


NO: 763' 
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CAAGTAGTGT 


(SBQ 


ID NO; 764) ; 


AAGTAGTGTG 


(SEQ 


ID NO: 765) ; 


AGTAGTCSraT 


(SBQ 


ID NO: 766) ; 


QTA0TOTQTG 


(SBQ 


ID NO: 767) ; 


TAGTGTGTQC 


(SEQ 


ID NO: 766) ; 


AQTGTOTOCC 


(SEQ 


ID NO: 769) ; 


GTGTGTGCCC 


(SBQ 


ID NO: 770) ; 


TGrroTocccsa 


(SBQ 


ID NO: 771) ; 


GTGTQCCCGT 


(SBQ 


ID NO: 772) ; 


TSTGCCCGrrC 


(SBQ 


ID NO: 773) ; 


GTGCCCSTCT 


(SEQ 


ID NO: 774) ; 


TQCCCXfTCTQ 


(SBQ 


ID NO: 775) ; 


QCCOaTCTOT 


(SBQ 


ID NO: 776) / 


CCCOTCTQTT 


(SEQ 


ID NO: 777) ; 


CCX5TCTGTTG 


(SEQ 


ID NO:778) ; 


cuTCi'O'rroT 


(SEQ 


ID NO: 779) ; 


GrCTOTTOTQ 


(SBQ 


ID NO: 780) ; 


TCTGTTGTGT 


(SBQ 


ID NO; 781) ; 


CTQTTQTQTQ 


(SBQ 


ID NO: 782) ; 


TGTTGTGTQA 


(SEQ 


ID MO: 783) ; 


GrrraTaTOAC 


(SBQ 


ID NO: 784) ; 


TTG?TGTGACT 


(SEQ 


ID MO: 785) ; 


TGTGTGACTC 


(SBQ 


ID NO t 786) ; 


QTQTGACTCT 


(SEQ 


ID MO: 787) ; 


TOTGTGACTC 


(SEQ 


ID NO :78a) ; 


GTQTOACTCT 


(SEQ 


ID MO £789) ; 


TQTQACTCTG 


(SBQ 


ID NO: 790) ; 


oxaiunxrroo 


(SBQ 


ID MO t 791) ; 


fi TQACTCraQT 


(SEQ 


ID NOt792) ; 


GlkCTCTGGTA 


(SBQ 


ZD M0:793) ; 


-fc ACTCTGOTAA 


(SEQ 


ID NO:7»4) ; 


CTCTGOTAAC 


(SEQ 


ZD MO: 795) ; 


: j TCTGOTAACfr 


(SBQ 


ID NO: 796) t 




(SEQ 


ZD MO: 797) ; 


TGGTAACTAG 


(SBQ 


ID NO: 798) ; 


aOTAXCTAOA. 


(SEQ 


ZD MOt799) . 



46. A mediod accocding to datm 30 vrbtnut add HIV-1 faeltata faM die identii^nng 
ehatacterisdcs of HIV-1 isolate ECACC AeeeMkm No. V»41«I7(»6 ar HIV-l iaokte 
ECACX Accession No. V94103H69 or HIV-1 isolate BCAGC Aooeaaioii No. 
V95031022. 

47. A medxod wv^ivg to claiai 3() ^i>hawin the SBV-l isolata is reactive to 
antibodies to a glycoprotein of HIV-1, is capable of indHojng an imanme raaptmat of 
gag»pol and/or env and wbich is {na^able of Erecting syndieris otmnef goie product 
or a fiill length ne/gene i»oduct 

4S. Ametliodaccoixlingtoclum47viieremlheglycopriatBfaiisatleastoneofgp4t- 
45, gpl20 and/or epl60. 

49. A method for vaccinating an individual against Ifae dttveloptaent of AIDS or 
AIDS related diseases, said method comprising administering to said individual a nim- 
pathogenic isolate of HlV-1 in an anunmt effective to infect target ceHs and to goiente 
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target cdU canying DMA derived finom said noa-pathogeiuc fflV-l. 

SO. A method according to daim 49 whetetft said isolate is cap^ie of stimulating 
an immune response in a human or primate against at least one gtycepcotein on HIV-1 
iNbUst not substantially reducing imliferative responses and cylokiBe prodnetion to a 
mitogen in said human CM* primate compared to a healthy, noii4itfac^ 
subject 

SI* A method acoMrding to claim 50 wherein said ia^ale carries one or more 
mutations in its genome resulting in die iiudAHy to direct syslhesis of at least cme 
polypeptide or protein from a palfaogenic strain ^ lflV-1. 

3 52. A method acc<»ding to claim 49 iwhetein said tacrfate carries ooe or more 

mutations in its genome resulting in said gMome Greeting i^vtfiesis of a tnmeated fonn 
of a polypeptide or protein from a pathogenic strain of HIV-L 

53. A method according to daim 51 52 wherein md fllV-l isolate carries a 
mutation in its n^gene and/or long termmal repeat (LTR) vm^Mi or in a fimctimially 
equivalent location in die HIV-1 genome. 

54. A method according to claim 53 vAnatin said HIV-1 isolMc is reactive to 
antibodies to gp41*45, gpl20 and/or gpl60 of HTV-l and canles a deietiOT <^ at least 
ten contiguous nucleotides in a region corresponding to Hne ne/fjcnc atid/or LTR region 
of pathogenic HIV*1. 

55. A method accoiding to claim 53 vAsaemn said HIV-1 iadste eamdcs a modified 
Kef protein without amino acids 162 to 177 of wild-type fflV lj^ Nef. 

56. A mediod according to claim 54 or 55 wherein said HIV-1 isolate is cq>aMc of 
inducing an immune te^nse to at least one of gag^ pel and/or env. 
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57. A mrthod according to claim 56 wb»dn said HIV-lis©l«lecaB^ 

its genome of at least 10 nucleotides within the tegion ftom n udcotidr 8787 to 
micleotide 9709 using die nucleotide numbering of HIV-l}^^. 

58. A method according to daim 57 wheidn said isolirte ewrief a deletion in its 
genome of at least 10 nucleotides within • legiom selected fipom the Hat oonsisting of: 
nucleotide (i) 8830-88<2; 

(ii) 9009-9035; 
Ciii) 9019-9029; and 

Civ) 9033-9049. 

J j 59. A method according to claim 57 wherein said isolate carriea a deletion in its 

^ j genome of at least 10 nucleotides within a tegion selected fiom the liM consisting of: 

S nucleotide (v) 9281-9371; 

N (vi) 9281-93«2; 

bj (vti) 9105-9224; and 

L (vitt) 9271-9370. 



60. A method according to cWm 57 whetdn said isolate caniea a deletion in its 
genome of at least 10 nucleotides within a region selected Item the \M consisting oft 
nucleotide (ix) 8882-8928; 

(x) 8850-9006; 

(xi) 8792-9041; and 
(xu) 9112-9204. 

61. A method according to claim 57 vdieicm said isobte carries a deletioa in its 
genome of at least 10 nucleotidet widiin a xegioii selected ftom ^ Krt consisting of: 
nucleotide (xiii) 9105-9224; 

(xhr) 9389-9395; and 

(XV) 9281-9366. 



SENT BY:Davies Colllson Cave ; 6- 6-93 ; 18:28 ; MELBOLTINE- 516 742 4366;ff62/34 

-243- 

62. A method aceordfag to datm 57 MOMBnm mid daiettoa ewr owpiiic ■ i< lost one 
of the JfoUowing decamcleotidee fiom tfae nef fone of HHV-lm^: 



ATOQarOGCX 


(SBQ ZD l»0i2) ; 


TOCHnOCCJUl 




ZD M0:3) ; 




Qaaraacfjua 


(SBO ZD HOs4) ; 


CUVOGKSUkGT 


(«BQ 


XO NOrS) ; 




OTOQCHAOTO 


(SBQ ZD MOt6) / 


1O0C3UI6TO8 


(MQ 


ID |K>:7) ; 




GGCAAGTGGT 


(SBQ ZD NOt8} ; 




(MQ 


ZD » 


>:t) ; 




CAAGTOGTCA 


(SBQ ZD NO: 10) ; 




(MQ 


ID 


110:11) J 


ASTGGTCAAA 


(SBQ ZD liOtl2) ; 




(mo 


ZD 


110:13) ; 


TGOTCAAAAA 


(SBO ZD NO: 14) ; 




(SBQ 


ZD 


liO:lS) 


? 


OTCAAAAAOT 


(SBQ ZD MO tic) ; 




(SBO 


ZD 


MO:17} 


; 


CAiUUiAGTAG 


(SBQ ZD EIO:ia) i 


JUUyUIOTAST 


(SBO 


ZD 


«0:lf) ; 


AAAA0TAG7G 


(SEO ZD NO:20) i 




(SBO 


ZD 


MO:2l) 


; 


AAOTAOTaTa 


(SBQ ZD NO: 22) ; 


MRMnaTOA. 


(SBO 


ID 


■0:23) ; 


OTAQTCnVIAT 


(SBQ ZD NO: 24) ; 


TllflVWQATT 


(SBO 


XD 


■0:25) ; 


AGTitmiAfrro 


(SBQ ZD NO:3C) ; 




(SBQ 


ZD 


N0t27) 




TOTGATTaOA 


(SBQ ID XK>tl«) ; 




(SBQ 


ZD 


■D:2») 




TOATTGGATO 


(SBQ ID NO: 30) ; 


GHurraaKToa 




ZD 


MO:31) , 




ATTGGATGGC 


(SBQ ID NO 1 32) ; 


1 JWUMIULlLt 


(no 


ZD 


110:33) , 




TGOATOOCCT 


(SBQ ID NOt34) ; 


OTfttOGCSCTfll 




XD 


110:35) , 




oMiaaccToc 


(SBQ XD M0I3C) ; 


JUNMOCRSCr 


(«KQ 


ZD 


MD:37) X 




TOOCCTOCra 


(SBQ ZD MO tit) 1 




(«BQ 


m 


MO:3») J 




OCCTOCTCTTA 


(SBQ ID NO: 40) ; 


oaroczcrrAJi 


(SBO 


ZD 


M0:4X) j 




CTQCTGTAAO 


(SBQ ZD NO:42) ; 


ToercTAiwcioi 


<BBQ 




110:43) i 




ocraTAACsao 


(SBQ ID NO:44) ; 


ararjuuxxiii 


(«BQ 




110:45) , 




TOTAAOaOAA 


(SBQ ID MOi4S) t 


CmUOClQAAA. 


(SBQ 


ZD 


MD:47) ; 


TAAOOGAAAO 


(SBQ ZD NOt48) % 


AMKKUtAAiQA 


(•■Q 


ZD 


»0:49) i 




AOQGAAAQAA 








ZD 


MO:Sl) \ 




OOAAAGAATG 


(SEQ ID N0i52) ; 




(SBQ 


ZD 


110:53) \ 




AAlkGAATQAa 


(SBQ ID II0:S4) | 




(SBQ 


ZD 


»0:55) \ 




AOAATGJIGAC 


(SBQ ID N0:$€)| 




(«BQ 


ID 


110:57) i 




AATQAOACOA 


(SBQ ZD NO:SS) ; 




(SBQ 


ID 


1K>:59)/ 


TOAGACGAOC 


(SEQ ZD NO: CO); 




(SBQ 


ZD 


N0:€1) ; 


AGACXAOCra 


(SBQ ZD N0te2) 1 


OftOQIUSCTaA 


(SBQ 


ZD 


llO:e3) 1 


ACGAOCTGAG 


(SEQ ID NO: 64) ; 


0(»aCTQAGC 


(SBQ 


ID 


110:65); 




GAGCTOAGCC 


(SEQ ID N0:6€) ; 


AQCTOAaCXA 


(SBQ 


ID 


1K>:67); 




GCTOAQCCAG 


(SBQ ID N0:6S) ; 


CTSiuaccs^ 


(SEQ 


ID 


MO:69) i 




TGAGCCAGCA 


(SEQ ZD NO: 70) ; 


GMSCCAGCAa 


(SEQ 


ID 


NO: 71) 7 
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AOCCAOCAOC (SSQ ID IK>:73); 
CCftOCAOCAa (SSQ ID llO:74) ; 
HaaUKSUaKI (SSQ id II0:7«); 
CMCMXTQO (flSQ ID MD:7t); 
OC3U3\9Oa0O (SSQ ID WOtBO) ; 
AGATQOGOTa (SIQ ID IIO:ta)/ 

AioGocmsGo (ana id iioti4); 
Qcsaafsaoaho (aao id aoiss); 
oaraoQAocK (no id iioiaa) ; 

TOOOAOCKOT (SIQ ID llOtfO); 
GOAOCAOTAT (SIQ ID llD:t2); 
AOCAOTIlTCT (ASQ id MOiM); 
CAOTTATCTCS (SOQ ID NOt»<)/ 
6TATCTCaiW9 («SQ ID MOtM)! 
ATCTOOAQAC (MQ ID MOtlOO) i 
CTOOAOACCT (8S0 ID llOtl02)) 
OaAOACCnUI (9m lO lK>tl04)f 
AOACCTAOAA (S8Q ID 1K>:10<); 
ACCTAOMAA (SSQ ID MOilOt); 
CTAQAAAAAC (SSQ ID l»iltO)| 
AQAAAAACAT (8IQ ID I»tll2) f 
AAAAACaiTOa (SBQ ID a0tlX4)i 
AAACATGOIUS (SIQ ID VOtXtt) t 
ACATOOAOCA (SBQ ID lK>tll8}; 
ATGOAOGAAT (SIQ ID IK>|120)| 
OGACCAATCA (SSQ ID NO:ia3)| 
AOCAATCACA (SBQ ID N0tl24) ; 
CAATCACM6 (SBQ ID 110tl34>i 
ATCACAAOTA (SBQ ID N0:12«) ; 
CACAAOTMIC (SBQ ID MO 1 130) | 
CAAGTAOCAA (SBQ ID MO: 132) ; 
AOTAOCAATA (SBQ ID 110:134) ; 
TAGCAATAOk (SEQ XD NO:U<); 
CiCAXTACAOC (SBQ ID NO: 138); 
AATACAOCAO (SBQ ID NO:140) ; 
TACAGCAGCT (SBQ ID NOtl42) ; 
CAOCAGCTAA (SEQ ID 110!l44) ; 
6CA0CTAACA (SEQ ID tIO:146) ; 
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(MQ XD NOi73); 
itm ID 1I0«7S); 
(MO ID IIOs77} ; 
(MB ID 110:79}; 
(MQ m M>}81) ; 
(Ma tD M0:e3); 
(MQ ZD MDrSS) ; 
(MQ m llDtft7}; 
(MO ZD IID:ft9); 
(MB ZD 110:91); 
(MQ ZD IK>z»3); 
(MQ ID HOtSS); 
(M9 ZD 1I0:»7); 
(MQ ZD tK>t 99) } 
ICZCnMSAOe (MQ ZO IIO:101); 
VOOAGACeBA (8*0 XD MOtlOS) ; 
dMOAOCTAOA (MQ ZD aOilOS) ; 

MiOciauaMUl imq id no* 107) t 

OCTAOAAAAA C«BQ CD MtlO*) ; 
UMAMAlXli iMQ 19 Milll) i 
MiMAACKM (MQ ID Mill3) » 
AMkACMXnA (MQ 10 Mtll5> ; 
AACASOGSMC (MQ » 110*117)7 
aaOH M C h h (MQ ZD IIDtll9) t 
fMAOCAAXC (MQ ZO ])Otl21) » 
OMCAATCAe (SBQ ZD lK>tl23) ; 
(KAATCACAA (880 ZO M0tl2S); 
AATCACAAST (MQ ZO HO 1 127) ; 
YGftCMGTAa (MQ 10 N0:129>; 

Amamooi (mq so N0ti3i); 

UmhOChAT (MQ 10 110:133) I 
(ilMOCAATAC (MQ XO NOiUS) | 
MCMTACMI (MQ n> 110:137} ; 
CMXACM3C3k (MQ 10 N0:13d); 
Jm^CAOCAOC (MQ 10 N0:141) i 
ACAOCAGCTA (SBQ XD NO: 143); 
AOCAOCTAAC (SBQ ID 110:145); 
CAOCTAACAA (SBQ ID NO: 147); 
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AOCTAACAXT (SBQ ZD 110:148) ; 
CnUkCM^TOC (flSQ ZD «0:1SO); 

AhciJiTacra (mq zd HOtisa) ; 

CJUITGCTOCT (SBQ ID IK>:154) / 
ATGCTOCTTO (SSQ XO MOtlSC) i 

acracTTOTG (sao id HOtisa) / 

TaCTTCPTOCC (BBQ ID VOtl€0) $ 
CI7GTQOCTO (SBQ ID HOtlO); 
TGrrGKXTGOC (SBQ ID llO:i<4) f 
TOCCTOOCmk (SBQ ZD MOslCC); 
OCTOGCTAOA (SBQ ID MOdCt); 
TOQCTAOIUia (SBQ ZD MO: 170) ; 
OCTAGAAOCai (SBQ ZD NO:17a) ; 
TAGA2U3CACA (SBQ ID liOtl74) ; 
OAAGCAC3MO (SBQ ZD If0xl7<) ; 
AOCACAMKO (SBQ ZD MOtlTt); 

CACAAcnoaii (sbq id iiosiao)/ 

CAAOAflGMia (SBQ ZD NOtlta); 
AOl^GQMXUUl (SBQ ID M0tlS4); 

haanaa M aA (sbq id iioiisc)) 
gmagjuaiwao (sbq id MOtiM); 

OaHAOAOOtO (SBQ ID UOtlBO) t 

AAOMOcrrGKia (sbq id TOti»a)i 

OAaOTaaOTT (SBQ zd HOtlf4) I 
GOTOaOTTTT (SBQ ZD 1I0:1»C) t 

Taacrrrrrcc (sbq zd nosiss); 

aOTTTTCCAO (SBQ ID NO 1 300) ; 

TTTTCCAGTC (SBQ ID 110*203) i 

TTCCAOTCAC (SBQ ID N0t304) ; 

(XAOTCACAC (SBQ ID NO 1 306) ; 

ACtTCACACCT (SBQ ID NO: 300); 

TCHOICCTCA (SBQ ID lK>t310) j 

ACACCrCAOa (SBQ id MOt313) ; 

ACCTCAOCnA (SBQ ID NO:3X4) ; 

CTCAGOTACC (SBQ ID II0t31€) i 

CA0OTACCTT (SBQ ID N0:21S)| 

aOTACXJTTTA (SBQ ID K0i220) ; 

. TACCTTTAiia (SBQ ID NO:222); 
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aCTkHCUaa (SBQ zd 1I0:149) ; 
TJUCAAXOCT (SBQ ZD 110:151) ; 
ACAMQCfOC (SBQ ZD BOslSS); 
AAfOCTOefr (BBQ ZD HO:155); 

iqcKie n w r (bbq id mo .-157) ; 

CWCHU t HC! <0BQ ZD MOslSS) ; 
acnOiaOCT (SIQ zd M0:1S1); 
TTOraOClOa (SBQ zd BD:163); 

oraocxooer (sbq n> bdiics) 

OOClOaCIM (SBQ ZD BD}167) ; 

cfoocnaiui (sbq o «Dtic»); 
aaexaoMae (bbq zd boi17i); 
cmaiumaic (sbq xd HDti73); 

MMOCJUaUi (SBQ ZD B0:175) ; 
JMIflCACiUWaBl (SBQ ZD BO: 177) ; 
OOCaUtfUM <«BQ ZD MOtl79) i 

MOM a ^ aa M («bq zd mosioD; 

HMfttMHWrn (BBQ ZD B0tl83) / 

maoMOMm (Bbq zb soties) i 

MOGtMWGUUB |BBQ ID 110(187) ; 
MOMyQAOOV (BBQ ID IK>tl8») ; 
OMOIWaGmMI (BBQ ZD IIOtl91) ; 

MMOOzcnorr (cbq zdiioii93}i 

JMOPtaOGTlT (BBQ ZD BOtlSS) } 

OrUGMKITTTTC (BBQ ZD B0tl97) ; 

QOOnTTCXA (BBQ 19 BOtl99) ; 

Uril ' l OCAgr (BBQ ZD HOiaOl); 

TnCCMSTCA (SBQ ZD B0t303) ; 

TOCMITCACA (MQ Ifi H0:a05) ; 

<ai0ICIkC3UX; (MQ id BO:207) I 

OICACMXrrC (MQ n> MO:209} ; 

OkCaiCCTCACI (BBQ XD B0j211) / 

cayocicmr («iq id kox313) ; 

(SCVCItflOtAC (BBQ ZD 110:315) ; 
TCMOTACCT (SBQ ID 110:317); 
AGKtnUXTTT (SBQ ZD RO:219) ; 
(STACCTTTAX (SBQ ZD 110:221); 
ACXrrTTAAGA (BBQ ID 110:223) ; 
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CCTTTAAOAC (SBQ ZD N0:234) ; 
TTTJUkOAOCA (SEQ ZD 110:226} ; 
TAAOJiCCaUkT (SBQ ZD 110:226); 

KOMOJOGk (8EQ ID 110:230) ; 

ACCAATOACT (SBQ ZD 110:232); 

GAATOACTTA (SBQ ZD NO:234) ; 

ATOACTTACA (SBQ ZD llOt23€); 

OACTTACAAO (SBQ ZD MOs23,a) ; 

CTTAGAAOGC (SBQ ID 110:240) ; 

TACAACK3C!AO (SBQ ZD 110:242); 

CAAGGCAQCT (SBQ ZD NO: 244) ; 

AOGCAGCTGT (SBQ ZD NO:24C) ; 

OCAOCTGrrAG (SBQ ZD llO:24t); 
AGCTGTAGAT (SBQ ZD IIO:2S0) ; 
CTOTAGATCT (SBQ ZD llO;2S2) ; 
OTAaATCTTA (SBQ ZD 110:2S4); 
AGATCTTASC (SBQ ZD N0t2SC) ; 
ATCTTAOCCA (SBQ ZD llOi2SS); 
CTTAQCCACT (SBQ ZD ll0t2C0) ; 
TA60CACTTT (SBQ ZD IIO:a«2); 
<3CCACTTTTT (SBQ ZD Il0t2<4)| 
CACTTTTTAA (SBQ ZD NOt2CC); 
CTTTTTAAAA (SBQ ZD IIO:2Ct) ; 
TTTTAAAAGA (SBQ ZD Ii0x270) ; 
TTAAAAOAAA (SBQ ZD 110:272); 
AAAAOAAAAS (SBQ ZD M0t274} ; 
AAOAAAAOOG (SBQ ZD NO:27«) ; 
GAAAAOOGGQ (SBQ ZD NO:27a) ; 
AAAOaOGncyi (SEQ ZD 1IO:2*0); 
AOGGGGGACT (SEQ ZD llOi2S2) ; 
QQQOaACCaa {SBQ id N0:2S4); 
aOQACTGOAA (SBQ 10 MO: 286) j 
aACTTGOAAGG (SBQ ID N0:2i«)| 

cmoM^aaac (sbq id no : 290); 

06AAGG0CTA (SBQ ID NO: 292); 

AAOGGCTAAT (SBQ ID NO: 294); 

GGGCTAATTC (SEQ ID NO: 296); 

(3CTAATTCAC (SBQ ID NO: 298) ; 
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CrVTMOAeC (MQ ZD l»t22S); 
TTMOAOCKA imO ZD ■0x227) ; 
AAOnOCMTO MQ TO N0x229) ; 
QMXMkXQAC (MQ to M0t231) ; 

ccMiirGBiCTr mo zo ■0:233); 

AAVOACTXAC (MQ XO BO: 235} ; 

TmemokA imq zs Mo<a37) ; 

ACmCAAOO (SBQ ZO ■0:239); 
TflkCIUUiaCA (MQ n> 1ID(241) ; 
ACmMOCROC (MQ m »0x243) ; 
AAMCMCra (MQ ID H0t245) ; 
OSGMOCVOTA (MQ ZD II0»247) ; 
GMCVOTMIA (MQ 9 ■Ds249) / 
(IGtmaATC (mm to MOt2Sl) ; 
1MMIATCTT (MQ ID «0i2S3) ; 
TMnSBCTTAa (MQ TO 110:255) ; 
OAlCmOOC (MQ m ■0:3S7) ; 
TCSmaOCftC (MQ » ilOt2S9) ; 

vmmx»crT ma to »0i2ci) ; 

MOOPkCSTTT (MQ » WOtW) ; 
OCMVfTTXA (MQ Si MOtlCS) ; 
AemrXAAA (MQ to mt*€7} ; 

nvrauuma (Mq xb ■ot2«9); 

TTTHAAAQilA (MQ ZD ■0:271); 

TAAAAOAAAA (MQ ZD ■0:273) ; 

A A HO A AAABO (MQ ZD M0:275) ; 

AOAAAAOOOa (SBQ ZD N0!277) ; 

AAAAOQQQQQ (MQ ZD N0:279) ; 

AMOOOGUAC (SBQ ZD ■0:281); 

(MOdMKACro (MQ W 110:283) ; 

CMOdlkCTOaA (MQ 10 110:285) ; 

aOMCrOOMta (MQ id M0:287) ; 

ACmOAJUSQO (MQ ID MO:289); 

TOMaaOCt (MQ ZD 110:291); 

QMaOGCTAA («|Q ID 110:293); 

AOQCXTTAATT (SBQ ID MO: 295); 

GGCTAATTCA (SBQ ID MO:297) ; 

CTAATTCACT (SBQ ID MO:299); 
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TAATTCACTC 
ATTC31CTCCC 
TCACTCOCAA 
ACTCCCAAAlQ 
TCCCAAAOAA 
CCAAAQAAOA 
AAAflAAfl A C A 
ACXAOACAAO 
AAGACAAOAT 
OACkAGATAT 
OU^aATATOC 
AOATATOCTT 
ATATCCTTGA 
ATCCTMATC 
CCTTQATCTO 
TTGATCTOTO 
GATCTOTaOA 
TCrOTOaATC 
TOTaOATCTA 
TOOATCIAOC 
OATCTACCAC 
TCtACXMBMl 
TACCACXCAC 
CCACACAC2iA 
ACACACAASO 
ACACAAOOCT 

AAOaCTACTT 
OOCTACTTCC 
CTACTTCCCT 
ACTTCCCTGA 
TTCOCTOATr 
CCCTOATTaO 
CTOATTOQCA 
QATTQaCAOA 
TTQOCAGAAC 
GGCAGAACTX 
CAOAACTACA 



(SBO ID H0:300) ; 
(SBQ ID Ii0s302) i 
(SBQ ID llOt304) ; 
(SEQ ID H0:30<); 
(SBQ ID ia0:30i) I 
(SBQ ID NOs310) ; 
(SEQ ID 110*312) ; 
(SSQ ID MO I 314) t 
(SBQ ID IIOt31<) I 
(SBQ ID ■Ot31t) I 
(SBQ ID lfO>320) / 
(SBQ ID NO t 322) ; 
(SBQ ID MOt324) ; 
(SBQ ID liOt32C) ; 
(SBQ ID 110t32«) ; 
(SBQ ID MO: 330) i 
(SBQ ID NO t 332) ; 
(SBQ ID N0t334) i 
(SBQ ID llOi33C) t 
(SBQ XD IK>t33S) ; 
(SBQ ID HOt340)l 
(SBQ ID HDt34a)) 
(SBQ ID MOs344) I 
(SEQ ID N0t34C) i 
(SBQ ID MO>34a) I 
(SBQ ID NO:3S0) ; 
(SEQ ID MOt3S2) ; 
(SBQ ID MOt3S4) | 
(SBQ ID M0i3S<) t 
(SEQ ID H0:3St) ; 

(SEQ XD MOs3<0) ; 

(SBQ ID 110:342) ; 

(SEQ ID N0:3M) ; 

(SBQ ID 1I0:3«) ; 

(SBQ ID NO: 348) ; 

(SBQ ZD NO: 370) ; 

(SEQ ID NO: 372) t 

(SBQ ID NO: 374) ; 



AARCACrOC (SBQ ID 110:301 
TTCACTOOCA (SBQ HI »D:303 
GACtOCClUUk (SBQ lO lK>i305 

cfoocauyyoA («bq id sotso?) 

OOCAAAWM (SIBQ lO »>t309} 

aoMSMuanc (bbq xo k>i3ii) 

AAflBHIfiAntA («BQ K>>313) 
^MOMChMA (MQ ID M0x315) 
MACIUiaATA (MQ XO BO: 317} 
AOUOATAIC (BBQ IB BO: 319) 
AMMXArOCT (BBQ B» B0iS21) 
OWBATOCnO (BBQ IB B0t323) 
(BBQ lO MOt325) 
(BBQ T» MO 1 327) 
CnnmXCTOT (BBQ XB H0:329) 
MaCTOXOB fBBQ » lR>t331) 
KtemOM* (BBQ XB MO. 333) 

evBraoATcv (bbq sb mo i 33s) 

BHBBATrmC MBQ m M0<337) 
BBMCTAaOA (BBQ IB «0<939) 
(BBQ m «3il41) 
(BBQ IB M0*14a) 
MCBCAGACA (BBQ XB N0t345) 
eMCMCMCMMa (BBQ XB M0t347) 
CRCIiCJUUiaC (BBQ ZD NO:349) 
CAeMOQCTA (BBQ IB NO 1 351) 
GAA96CTACT (BBQ ID MO: 353) 
jiaOerACTTC (bbq id N0:3SS) 
OefACTTCGC (BBQ » MO 1 357) 
tUBTtCCCSa (BBQ IB Ma:359) 
CTtGOCTOKt (BBQ ZB WitUl) 
TOOCrOATTQI (BBQ ID B0»3«3) 
C(!P IA TT g QC (BBQ TO BO: 345) 
TCKROaCMS (BBQ ZD IIO:3«7) 
ATHaGCAOAA <8BQ ID N0i3€9) 
TOQCAGAACT (SBQ ID N0:371) 
GCMSAACTAC (SBQ ID NO: 3 73) 
AOAACTACAC (SBQ ZD NO:37S) 
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GAACTACACA 
ACTACACACC 
TACACACCAG 

CACCAaaacx: 
ccAooaccua 
AOoaccAOoa 
cKsccAoacKrr 

CCSUXTOOTCA 
JU30GOTCAGA 
OGGrrCAGATA 
QTCAOATATC 
CAOATATOCA 
GATATCCACT 
TATCCACTOA 
TOCACTGACC 
CACreACCTT 
CIOACCTTTO 
GACCTTTOGA 
CCTTTOOATa 
TTTOaATOOT 
TOGATOGrraC 
GATOGTacrA 
TGGTaCTACA 
OTGCTACAAa 
GCTACAAOCT 
TACAAGCTAG 
CXAGCTAOTA 
AOCTAOTACC 
CTAGTACCAG 
AOTACCAOTT 
TACCAOTTOA 
CC»(3TT6AGC 
AGTTOAaCCA 
TTOAGCCAGA 
GAGCCAGATA 
GCCAGATAAO 
CAGATAAGkST 



(SBQ ID 1K>:37C) ; 
(SEQ ZD tK>t37«) / 
(8EQ ID 110!380) / 
(SEQ ID 110:383) ; 
(SEQ ID 110:384) ; 
(SEQ 10 110:380 ; 
(SBQ ZD IIOt388)/ 
(SBQ ZD llO<3»0) / 
(SEQ ID 110:392) ; 
(SEQ ID llO<3»4); 
(SBQ ID MD:3»C); 
(SBQ ZD llO:3»8) ; 
(SBQ ID 110:400) ; 
(SBQ ZD HO:40a} ; 
(SEQ ZD MO I 404) I 
(SBQ ZD 1IO:40C) ; 
(SBQ ID NO (408) ; 
(SBQ ZD 110:410) i 
(SBQ ZD iiOt4ia) I 
(SBQ ZD MO I 414) 9 
(SBQ ZD II0I41O I 
(SEQ ZD M0:418)| 
(SBQ ID igOi4aO); 
(SEQ ZD NO: 423) ; 
(SBQ ZD NO: 424) ; 
(SEQ ZD NO: 428) ; 
(SEQ ZD NO:428) i 
(SEQ ZD HO 1430) | 
(SBQ ZD NOi432) t 
(SBQ ZD NO: 434) ; 
(SEQ ZD II0:43€) ; 
(SEQ ID liO:43$) ; 
(SEQ ID MO: 440) ; 
(SEQ ID NOi442) | 
(SEQ ID NO: 444) ; 
(SEQ ID N0:44O ; 
(SEQ ID NO:448) ; 
(SEQ ID NO: 450) ; 



AAC1ACACAC 
CTKCJiCXCCA. 
AOMACCAOO 

AOkCcayaooc 



iXUOQIflOOO 

oemoaooTC 



oooorraiGAT 



TCMUtTAITCC 
AOATATCCahC 
AfniTCCaiCTG 
ATOCACTGAC 

ACVQMCCTTT 

AocmgoKr 



TOGTACAAac 

AGMOCTAOT 
AAOCTAOTAC 
OerAGTACXIA 
TMOVACCAOT 

oiAcxauiTTa 

AOCHOTTOAa 

CMrroAfloc 
arTOAoccAs 

TOftGCCAGAT 
AOGCAOATAX 

CGSIiaATAAaO 
AGMAA(30TA 



(MQ ID NO: 377) ; 
(SBQ ZD NOt379) ; 
(MQ ID HO:381) ; 
(MQ ID NO:383} ; 
(MQ ZD H0:a8S} ; 
(MQ ZD 110:387} ; 
(MQ ZD 110x389) / 
(MQ ZD 110:391) ; 
(MQ ID NO: 393) ; 
(MQ lO 110:395) ; 
(mo ZD 110:397) ; 
(MQ ZD NO: 399) ; 
(MQ ZD 110:401) ; 
(MQ ZD 110:403) ; 
(MQ ZO 110:405) ; 
(MQ 2D 110:407) ; 
(MQ » MOt409) ; 
(MQ ZD 110:411) i 
(MQ ZD 110i413) ; 
(MQ I» N0<41S) ; 
(MQ W llO«417) I 
(MQ » ■0:4X9) ; 
<MQ ZD M0t42X) ; 
(MQ ZD 110:423) ; 
(MQ ZD 110:425) ; 
(SBQ ZD NO: 427) ; 
(MQ ZD NO:429) ; 
(SBQ ZD NO:431) ; 
(SBQ ZD NO:433) ; 
(MQ ZD NO:435) ; 
(MQ ZD NO:437) ; 
(MQ ZD 110:439) ; 
(8IQ ID M0:441) ; 
(MQ ZO 110:443) ; 
(MQ ZD N0:4«5} ; 
(SIQ ZD HO: 447) ; 
(SBQ ZD N0:449) ; 
(SBQ ID N0:45l); 
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OAXAAaOTMS (SBQ ID 110:452) ; 

TAAOOTftOAX (SBQ ID NO:4S4) ; 

AGOTAaAAQA (SBQ ID liOi456) j 

GTAOAAGAOa (SBQ ID IIO:45t) ; 

AGAAGAGOCC (SBQ ID 1IO:4«0) ; 

AAGAG6CCAA (SBQ ID HO 1 462) ; 

GAOGCCAATA (SBQ ID M0t4C4) ; 

aaCCAKTMA (SBQ ID IIOi4S6) ; 

CCAXTAAAGG (SBQ ID N0:4SS) ; 
AAanUVAGGAO (SBQ ID 110:470) | 
TAAAGGAiGAO (SBQ ID M0:472) ; 
AASGAOAGJUl (SBQ ID Il0i474) / 
OOAOAOAACIk (SBQ ID IID:47C) ; 
AGAGAACACC (SBQ ID llOt47t) ; 
A(3AACACai8 (SBQ ID IIOt4tO) ; 
AACAOCAOCT (SBQ ID M0:4S2) i 
CACCAC3CTT0 (SBQ ID liOi4t4); 
CCA0CTTOTT (SBQ ID llOi4t«) ; 
AOCTTCSTTAC (SBQ ID MOi4ta); 
CTTGTTACAC (SBQ ID l»t4fll) ; 
TGTTACACCC (SEQ XO iK)t4f2)i 
TTACACOCTO (SBQ ID N0s4t4) ; 
ACACCCTOXO (SBQ ID |IOt49<) ; 
ACCCTOTGAa (SBQ ID liOt4M) 7 
CCXOTOAGCC (SBQ ID NO:SOO) # 
TGrOAQCCTG (SBQ ID IIO:S02>; 
TOAOCCTaCA (SBQ ID IIO:S04)| 
AOCCTGCATG (SBQ ID HOiSOOl 
CCTOCATOGA (SBQ ID liO:SOa) ; 
TOGATGKIAAT (SEQ ID N0x5X0)| 
CATOOAATGO (SEQ ID N0tS12) ; 
TOGAATGQAT (SEQ ID 180:514); 
GAATGGATGA (SEQ ID liO:SI<); 
ATOOATGACC (SBQ ID SO t SIS) ; 
OGATCOiCCCT (SEQ ID NO:S20) ; 
AIXSACCCim (SEQ ID MO:S22) ; 
GACCCTGAGA (SEQ ID NOt524) ; 
CCCTGAOAGA (SEQ ID NO:52() ; 
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ATAMIOTAOA (BSQ ID IIOi453) 1 
AMOTAGAAa (SBQ ID NO; 455) ; 
OcrmOAAOAa (SaO lO M0:4S7) i 
rh aMO M XaC (SKQ id li0t4S9}; 
QAKSMaoOCK (SBQ ZD 1)0:4^1} ; 
AOAOOOCIAAT (MQ ID MOt4€3) ; 
AOOOCIMTAA (BBQ ID MOt4«5) ; 
OCXaiAXAAAO (4IBQ ZD 110:467); 

CM K tMMa a n (ma id aDi4C9) ; 

ATM«flQaW» (MQ IB M0t47l) ; 
IMUWCfcOABA (SBQ IB m:473); 
MKUnHfOitur (SBQ ZD IIDt475); 
OMMOAACAC (BBQ ZD IK>s477) ; 
aHflMC A Oai (BBQ ZB B0:47»); 
atJU eMC CMC (SBQ ZD NO: 481) ; 
AGMOOCZT (BBQ ZO aO:463); 

McauamoT (bbq zb BO:4as) j 
<aMK:rraTTA (bbq i» MOt4a7) ; 

aenOTTACA (BIQ id IIOt4S9); 
TSBtTACAOC (BBQ ZB M0t4»l) : 
armCMCOCT (BBQ ZD flO:493) ; 
TACiyOeCinr (BBQ ZB 1K>:49S) ; 
CBOOCf O T O A (BBQ IB BOi497) ; 
OOeVBIOMiC (BBQ IB MOt499) { 
CtnaUKSCT (BBQ ID MOtSOl) 1 
mBOCCTGC (BBQ ZD 110:503) ; 
a»BOCTOCAT (BBQ ZD 1IO>$05)| 

ooevocAToa (bbq zo bo : 507)1 

CMCATQOAA (BBQ ZD 110:509); 

acKKOOMao (bbq id tiOiiiDf 

mSMOaati (bbq id II0:S13) ; 

aakXTGOKta {no to mo : sis); 

AAlQaXTGAC (HQ ID H0:S17) i 
TOOKiakCCC (SBQ ID NOsBlS); 
amhCCClQ (SBQ lO 1I0:S21) ; 
TQftOCCTGAa (SIQ ID N0tS23); 
ACXXrrOAGAG (SBQ ID NO:525) ; 
(XTGAGAGAa (SBQ ID MO: 527); 
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CTGAGAOAOA (SEQ ZD NO: 528) ; 

OAOAGAGAAO (8BQ ID 190:530); 

GAOAOMOTO (SBQ ZD 110:532) ; 

GAGAAOrCITT (SBQ ID NO: 534); 

GAAOTQTTAO (SBQ ZD M0:S3S) ; 

AOTOTTAGAG (SBQ ID HOxS38); 

TOTTAQAGrro (SBQ ZD NOtS40) ; 
TTAOhOTOaK (SBQ ID 110:542); 
AQAOTOGA06 (SBQ ID NO: 544) ; 
AOTGOAaOTT (SBQ ZD M0:S4C) ; 
TOGMOama (SEQ id NO:54t); 
GAGKITTTGAC (SBQ ID NOt 550) ; 
OOTTTGACAia (SBQ ID N0:SS2) ; 
TTTGACAOCC (SBQ ID NO:SS4} ; 
TGACAOCXXX: (SEQ ID NO:SS€) ; 
ACAGCOOCCT (SBQ ZD NO:S$i); 

Aocoacxmia (sbq id NOtsco); 

COOCCTAGCA (SBQ ID NO:SC2) ; 
OCCTAGCATT (SBQ ID NO:5C4) ; 
CTAOCATTTC (SBQ ID NOtSCS) i 
AaGKTTTCAT (SBQ ttk NOtSCS) / 
CKmCKtCK (SBQ ZD NO«B70) j 
TTTCATCACS (SBQ ID M0i572) ; 
TCATCACOTG (SBQ ID NOt 574) i 
ATCACOTOOC (SBQ ID BIO:S7€) ; 
CACOTOaCCC (SEQ ID NOt 578) ; 
CGTGOCCCXSA (SEQ ID NO: 580) ; 

TGGCCCaAOA (SBQ ZD NOt 582) ; 

GCCCJGAOASC (SBQ ID NO: 584); 

COSAOAOCTO (SBQ ID NOtSSC); 

GAOACiCTGCA (SEQ 10 NOtSM) / 

GAOCTOaiTC (SEQ XO NOt 590) ; 

<3CT(3CATCC!G (SBQ ID NOt 592); 

TOCATCCGKUl (SBQ ID NO:S94) / 

CATCmaAOT (SBQ ID NO: 596) ; 

TCCGOAOTAC (SBQ ID NOt 598); 

CGQAGTACTT (SEQ ID NO: 600) ; 

GAGTACTTCA (SEQ ID NO: 602); 
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TOAOAOAGAA (SBQ ID NO: 529) 
AOMAOAAOr (SBQ ID NOt 531) 
AaMMAOiaT (MQ ZD MO;533) 
AOMaVaTTA (tMQ ZD ND:S35) 
MMnOTTAOH (MQ ZD BDtS37) 
<mTTMiMtr (mo ZO M0:S39} 

omoMrras («bq zd bo:S41) , 

TMAOflOaAa («BQ ZD HO:S43) i 
OMnaaAaOT (MQ id N0:S45)j 

moAoarrr (mq zo NOt547) ; 

mMemHA (MO ZD NOt 549); 
liMIllUAC3i (mo n» NOtSSl) ; 
<WTTOACAaC (MQ ZD ifOt553) ; 
TTO^ChaCOS (MQ ID NO: 555) ; 
OIUBMCOaCC (MQ ZO NOtS57); 
CM0C300CTA (MQ ZD HOtS59) ; 
OOSflKXaCAOC (MQ IB MO:5Cl) ; 
OaCCKAOCAT (MQ » M0:5C3) ; 
OCnOCATXT (MQ ZO 1I0:5C5) ; 

mammcK (mq zo moissv) ; 
MMmcAae imq io ootscg) ; 

AmnKSLVCAC (MQ IO MOtSTX); 
TVeRTCAOOT MQ ZO 110x573) ; 
CMPGACXITaai (8nQ IO M0tS7S) ; 
TCAOOTOOOC (MQ ID MO: 577) ; 
AOIVOaOOOO (MQ ZO MOt579) ; 
CrraUULtUAO (mq io N0:581) ; 
OQKSOOOAGAa <MQ IO MOtSOS) ; 
OOOOAOAaCT (MQ IO NO:S85) ; 

oa^Mocxac (mq zd iiotS87) ; 

AOMKIGG^T (HQ ID 110:589); 
AaCKCATCC («BQ ZD 110:591) ; 
CIOGATCXXXI (MQ ZO MOtS93); 
GCMOCGQAa (MQ ZO MO:595}; 
KKOGGMfSh (SBQ ID NO: 597) ; 
C(3aaA0TACT (81Q ID 110:599) ; 
OGAOTACTTC (8KQ ID 110:601); 
AGTACTTCAA (SEQ ID MO!603); 
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OTACTTCAAa (8BQ ZD liOiC04); 
ACTTCAAOJUl (SEQ ZD VOtCOC) i 
TTCAAOAACT (SEQ ZD IIOx60«) ; 
CAAOAACroC (SBQ ID NOtSlO)/ 
AOAACraCTO (SBQ ZD aOi€12) t 

6X A method aoooniing to daim 57 
of Ifae following decaaudeoddei fnm the 
ACCA0CTT6T [SBQ ID 110:803] 
CAiOCTTGTTA CSBQ ZD NDtSO!] 

GerraTTACA isbq id mo : sot] 

TTCrrTACACC [SBQ ID NO; •Of] 
CrrTACACCCT [SBQ ZD MO: Oil] 
TACACCCTOT ISEQ ZD MO:tl3] 
CAOCCTOIQA [SBQ ID MO:tl5l 
CCCTOTOAOC [SBQ ZD NO 1 017) 
dCTOAOCCT [SEQ ZD NOsftlf) 
OTOAaCCTOC [SBQ ZD MO: 831) 
GAOCCTQCAT (SBQ ZD N0:823] 
OCCXOCakTGKI (SBQ ZD HO: 828] 
CTOCATQGAA. (SBQ ZD NO: 827] 
GCATOOAATO (SBQ ZD MO: 828] 
ATOGAATOGA (SBQ ZD NO: 831) 
OGAATaaATO [SBQ ZD MO: 833] 
AATOGATOAC (SBQ ZD HO: 835) 
TOOATGACCC [SBQ ZD NO: 837) 
GATGACCCTO (SBQ ZD MO:83l»] 
TGACCXnXAO [SEQ ZD MO: 841] 



TAnrCAAOA (SBQ ZD HO: SOS) ; 
CmCAAOAAC («BQ ZD NO: 807) ; 
TCMOAACIO (mtQ ZD »0:<09) ; 
MOMCTOCT (4B8Q ID NDiCll) ; 
OAACTOCTOA (MQ ZD HO:Cl3) . 



iSid dffffllMI 

OCMCnOTT CMQ 
AlWliUi lAC (flBQ 
CmfTTACAC fMQ 



CMQ 

Manxxxofra mo 
Aooc ro i u Afl imQ 

OBMnMOOC ESBQ 

ToraAoccxo imaQ 

MMOCTOGA (MQ 

AoeeracATG cmq 

OGTOGAiaOA IMIQ 



TomxaoAT ti 

OAATOGATOA (I 
AVOQATGACC [1 
GOAXaAOCCT (CBQ 
ATOACOCTOA («BQ 



ZD NO:804] 
ID IIO:80«] 
ID 110:808) 
ZO MO: 810] 
ZD NOt812] 
ZO MO: 814] 
ZD NOiSlC] 
ZD 110:818] 
ZO MO:830] 
ZD HDt822] 
ZO lK>:a24] 
ZD mt92€l 
ZD ■0x828] 
» 110:830] 
ZO MO t 832] 
ZO NO: 834] 
ZD MO: 836] 
ZO HO: 8381 
ZO HO:840] 



64. AmetfaodaccoidiBslDclaiBiS? 
of the foflowing doeanudeotides from the 
GCTTTTTQCC (SBQ ZD M0:SS2) ; 
TTTTTGCCTQ (SBQ n> H0:(S4) ; 
TTTQCCTOTA (SBQ ID NO: 656)/ 
TQCCTGTACT (SEQ ID N0t6S8); 
CCTGTACTOG (SEQ ID NO: 660) ; 
TGTACTGGGT (SEQ ZD NO: 662); 



Mid defetidn OMOiBpiaM at least one 
Lilt Mitoo of nv-tiQ^: 

CTtTTTOCCT <«BQ ZD V0:«S3) ; 

TTTTOCCTOT (8IQ 10 N0t655) ; 

TTOOCTCfTAC (ntQ ZD MOt6S7) ; 

aGCTOTACrO (SIQ ZO MO:659) ; 

CTOTACTaOG (SBQ ID NOS661) ; 

GTACT6GGTC (SBQ ID NO:663) f 
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TACTOG(3TCT 
CTOOGrTCTCT 
OOOTCI^CrCT 
grC T CTCTO O 
CTCTCTOOTT 
CTCTOGPITAQ 

CTOonnrAaAC 

OOT1!AiGaiOCA 
TTAGACCAOA 
AOAGCAiOATC 
ACCAOJITCTO 

OATCTOAaCC 
TCTGAOCCTO 
TXlAOCCTOaO 

CCTQKKOkOCT 
TOOOMCTCt 
OOMCTCTCT 
AGCTCTCroa 
CTCTCTOOCr 

cTcxooclnui 

CiaOCXAACT 
GOCrAACTAO 
CTJUICTAOGO 
AACTAOGOM. 
CTAOOQAACC 
AGaaAACCCA 
GGJUkCCCACT 
AACCCACTOC 

occAcracTT 

CACTGCTTAX 
CTOCTTAAOC 
GCTTAAGCCT 
TTAAOCCTCA 
AAOCCTCAAT 
OCCTCAATAA 
CTCAATAAAO 



(SBQ ZD II0:CC4) ; 
(SBQ ZD mt€S$)t 
(SaO ID ||0:<C8); 
(SBQ ZD NDsCTO) » 
(SBQ ZD 110sC72); 
(SIQ ZD li0j<74) ; 
(SBQ ID llOiC7C)| 
(SaO ZD M0tC7«) / 
(SSQ ZD MOtCaO) ; 
(SBQ ZD «0«Ct2) ; 
(SBQ ZD NOsCM) i 
(SBQ ZD BOtCaC) ; 
(SBQ ZD MOtCaa) / 
(SBQ ZD BOtCaO) / 
(SBQ ZD BOiCaa) f 
(SBQ ZD MOtC»4)i 
(SBQ ZD BO tCaC) I 
(SBQ ZD MOtCaa)! 
(SBQ ZD K>t700); 
(SBQ ZD BOtTODi 
(SBQ ZD IK>tT04) | 
(SBQ ZD IIOt70<) / 
(SBQ ZD MO<70a> $ 
(SBQ ZD MO t 710) I 
(SBQ ZD 110(712) t 
(SBQ ZD M0x714)> 
(SBQ ZD MOt71C) t 
(SBQ ZD NOt7ia)/ 
(SBQ ID HQ} 720); 
(SBQ ZD 110:722) j 
(SEQ ID NO:724) ; 
(SBQ ID NO: 726); 
(SBQ XD IIOi72t)f 
(SBQ ZD HO: 730) ; 
(SEQ ID NO: 732) ; 
(SBQ ZD 110:734); 
(SEQ ID N0:73() ; 
(SEQ ID NO: 738) ; 



» C fCZCT8gr 



tcgeiiain'ii A 



liiCMyuic 



aMCCVGOOA 

oeeiaaaAac 



OMCfCTCXO 



(MUUrCCACTO 
A0(9GRCTaCT 
OGMROClTA 

Acracfi'AAa 

TOeiTJUUSOC 



TJUbOOCTGAA 
AOCXrrCAATA 
CXTCAATAAA 
TCAXTAAACC 



(SBQ ZD BOt£4S) ; 
(BBQ ZD BOt€67) ; 
(SSQ ZD NOtm); 
CBBQ ZD «0<C71) ; 
(BBQ to BDtC73) ; 
IBBQ ZO BD(C7S) ; 
CBBQ XD BOf €77) ; 
(«BQ ZD mtC7a) ; 
(BSQ ZD BD*«81) t 
(«BQ ZD n3iC83); 
CCBQ ZO BOtCas) f 
(BBQ ZD BO<€a7); 
(SBQ MO BOtCaa) ; 
(<BQ ZO MOiCai) ; 
<BBa ZD BOt4a3) ; 
(BBQ ID HOICKS) I 
(«80 ZO MDtC»7) ; 

mo m BOicaa); 

(BBQ ZO 10:701) ; 
CWQ SO MDi703); 
C«BQ 10 aOi70S); 
(BBQ ID aK>t707) ; 
(BBQ fO BOt709) ; 
(BBQ to BOt711) ; 
(BBQ » BOi713) ; 
(SBQ ZO BO 1715) ; 
(no ZO MO > 717) ; 
(BBQ ZO iK>i719) t 
(B8Q ZO N0:721) ; 
(BBQ lO M0t723) ; 
(BBQ XD 110:725) ; 
(BBQ ZD HQ:127) ; 
(SBQ ZD BO:729)f 
(BBQ ZD MO:73l) ; 
(BBQ ZD NO:733); 
(BBQ ID 110:735) ; 
(SBQ ID NO: 737) ; 
(SBQ ID MO: 739) ; 
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CAATAAAOCT (SEQ ZD NO: 740); AAXAAAOCTT («BQ ZD ■Ot741) ; 

ATAAAOCTTO (SBQ ZD Il0t742} ; nUUMttZTOC lOQ ZD IK>«743) ; 

AAAflCTTOCe (SBQ ZD llOt744)f AMCVTOGCT («BQ ZD IIDt74S)/ 

AOCTTOCXTT (SBQ ZD 1102740/ aeXMaCTlO («1Q JM> M0i747); 

CnoCCTTGA (SBQ ZD N0}74t); f l liOLItUA g (CBQ ZD M0t749) ; 

TGCmanOT (SBQ ZD HO: 750); OCKnOMRa («BQ ZD MOt7Sl} ; 

CCTTOAGrraC (SBQ ZD H0t7S2); CnOBiafaCT (MQ n HOi753)| 

TTOACrrOCTT (SBQ ZD 110<7S4) ; VOMIOCTTC (BBQ ZB HOt755) ; 

aAOTGCTTCA (SBQ ZD 1K>:7SC>| AOVQICITCAA (BBQ ID BOt757>; 

OTOCTTCAAa (SBQ ZD 110:75t) ; VOCnGAAOr (BBQ ID B0t759) ; 



GCTTCAAOTA 


(SBQ 


ZD MOt7€0>; 


CRCMOKAO 


(OQ 


ZD |10*7«1) ; 


TTCAAGTAOT 


(SBQ 


ZD MOt7C2); 


TCBUhtmtfnRB 


(«BQ 


ID HOt743) ; 


CAAOTAOTOr 


(SBQ 


ID N0:7C4) ; 


AMMAOVOni 


(«BQ 


ZD MDx74S) ; 


AOTAaTOTOr 


(SBQ 


ZD NO:7CC)i 


OPMIOTOO 


(«Q 


IB MDx7S7) ; 


TAOTOTQTQC 


(SBQ 


ZD IIOt7Ct); 


AOMROIOOC 


(«DQ 


ZD N0x7€9) ; 


QTOTQTOOOC 


(SBQ 


ZD NO:770) t 


TBVsiracsooci 


(«BQ 


ZB IIOi771) ; 


OTGrrocooOT 


(SBQ 


ZD Il0t77a); 


rtnaocoaas: 


(BBQ 


Z» Il0i773) ; 


OTOCCtaOTCT 


(SBQ 


ZD 110t774) 1 


raooooTcio 


(BBQ 


n> H0t775) 1 


accceTCKiT 


(SBQ 


ZD ll0t77C) ; 


csGCMnccraTr 


(m 


IB 110x777) ; 


OOOTCTOTTO 


(SBQ 


ZD 110 X 770) ; 




(Ma 


1» nOt779) ; 




(SBQ 


ZD Il0t7t0)i 




ima 


IB IIQt7ai) ; 


CIOTTUTdlG 


(SBQ 


ID HO X 702) ; 


ItfTBOnRSTRlA 


(«Q 


IB M0t7fl3) ; 


GTraTOrTGAC 


(SBQ 


ZD H0t7t4) ; 


TTaPVOTSACT 


(SBQ 


IB HOt795) ; 


TOToraxcTC 


(SBQ 


ZD MO 1 704) ; 


<1V9|IQACTCT 


(SBQ 


no liO>707) ; 


TOTOTGACTC 


(SEQ 


ZD llOx7at) ; 


OTOVOSACTCT 


(SBQ 


ZD IIOt789) ; 


TQTGACrCTG 


(SBQ 


ZD 110:790) ; 


OTOACTCTaa 


(BBQ 


IB 110x791) ; 


TOACTCTaOT 


(SBQ 


ZD 110x792) ; 


OAGTCIGGTA 


(SBQ 


ZD liOs793) 1 


ACTCTCOTAA 


(SBQ 


ID 110x794) ; 




(SBQ 


ZB MOx79S) ; 


TCTGK3TAACT 


(SBQ 


ZD irOt79<) } 


CraOTAACTA 


(mg 


ZB MO 1797) ; 


TGKaTAA(n!!AO 


(SBQ 


ID NO; 798); 




imo 


IB 110x799) . 



65. Ame(hodao(»idiQgtodaim49wfaeraiMMidHIV-l in^ 

duracteristiet of HIV-1 iiolate BCAGC Antadm No. VM101706 4r HtV-l isolUe 

Aoocaaioa No. ECACC V94103n«9 or HIV-l isolate BCACC Aoeenon No. 

V95031022, 
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66. A mdhod aooocdiQg to claim 49 iMhawin ihs UIV-l iaolato is iwctive to 
•ntibodiei to a glyoopnMeta of HIV-l. it eapMt of indoal^ m kmmmm rmpouc of 
gag^ pol and/or env and wfaidi ia innapahlc of MueUng ijiiihMli <^ « mffgrn pcodnct 
or a All] leagth ne/" gene product 

67. A meUKXi accowfing to claim 66 wUuahiAc gly nnjiinm i i it at laaat oae of gp41- 
45, gpl20 aad/or gpl60. 

6S. Am^todfitf oMaimngapniwitioaofBimiMdw^^ 
sample^ said nietfaDdcaMyrisiagco<«ii«iBg>>M^ mmpie from 

an individual putativ^lx infected wUIk aaU WNHpadiofMia HIV^I mkk HIV-l 
aeroocgalive donor ^WICs depleied tot CO$*- odlt. hannaiii^ Iho PBMCs and 
siqiematant fhdd evesy from about 5 to aboot 10 days and aiilQg fmfc modhaw with 
CD8-I- depleted PBKfCs widi aald freA midiaM tad iaOaAt^ mid ^bm ftom die 
siq^ematant Add. 

69. A medtod t r mf iO ng to daim 6t aawpspliwi FBIiOi ara p icUaated 
wiihacytoMneferatiaaaandunderooiiditiniiaaiidBcii^^ IflV-i tapHcartoa 

70. A method accoidiflg to daim 69 wfaodtt the cytoktat it M-CSF. 

71. A meAod acrotding to daim 6t or 69 or 70 vdtadm die HIV-l aeranegative 
donor PBMCs are first sHmnlatBd witfi PHA SttdAir IL-2. 

72. Ametitodaccoiduigtodaim6S wbemlltoPBMGifiomtpi^^ 
individual are also CDS-f depleted prior to ooedliiiifig with aaMMgadwo dooor PBMCs. 

73. Amediodaccoidii)gtoddm6Sor72iiAeidn<heooeiiltiiredctiltaresu^ 

to UV irradiatioiL 



SENT BY:Davles Coilison Cave ; 6- 6-95 ; 18:34 ; MELBOURNE- 516 742 4366;«(74/34 



-255- 

74. AnwdwdforotoiniQgftpiepiatfoiiafM^ 

maple, said owdiod ooniMuung cxniimkm moooeytet tmu mid j^fllogiol trngic 
from an iadividiial putetivdy infiMMd with «id huh jiUliiawrfr IBV-t wbfa HIV-1 
wtonegitivc donor PBNfCs dqtlrtwl tar edit, \m tmUnn itm m uumi yttt and 
PBMCi and aupemalaat fluid emy fiom abaoc 5 to ahem M days ml adding freah 
medium with C3>8+ depleted VBMCt widi aaid Ikedi mtntlin and 'ff^Vfipg aaid vims 
Ihxn Ae sqiemattut fluid. 

75. Ainediodaccotdiqgtoclaiai74iMiieidaAeB iiiw s t a ll^ 
wifea^toldnefor»tfaMandiMdef oon<HtioMsnffiiintfta inhanre WV-I repHcaHoo. 

76. A mediod aocoiding to dafaa 75 ^Amtkk die ^rtoidna ia M-CSF. 

77. A mediod aooonUqg to date 74 or If ar T6 whanin *e II1V>1 aaronegative 
donor PBMCi are fint srimnlalad wtth PHA aad/br IL-2. 

7S. A med>od aecoidteg to cfadm 74 tadwwlii <>e iiMuaxytii <K>m ■ iwifBliM ingwied 
individual ate also CDt-» depleted prim to f o^iiihiliiK mlHi iwmiiatatliin dfiiiiw PBMCa. 

79. Amethodaccortogtoclalm74or78iatoeintfaecociiilywdoeBaat»sB^^ 
to UV inadialion. 

80. A mediod fisr identifying or arafwintng tor oompoonda cap^Ua of nducing or 
otherwise interfixiqg widi fflV-l npUcadoiw said mediod a omjiri i i i g rowtartit^ a 
compound to be tested widi a or ceil cxlnot eontnining or capable of containing a 
ne/geoe piodoct fused to a repoiler molecute cqMble of fiviag an ideiMituMe signal 
and screening for a compound whidk inhtUn aiid signiL 

81. A mediod according to claiffl 10 wheidB the wwnponnd is an andbody to nef 
gene pradud or a part thereof. 
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82. A — fipp -i|rim fITirrtMirMn Ihn firtimnwd !■ ■ ntf il tam<iim iii 

and itdiilrits tfmacrifKkm of die Mg^fene. 

S3. A compound capable of Miibiting mf gene pmAMt tmctkm or gne 
tnosttiption. 

84. A compound capaUo of inhibidnf mf geae pradaal fimolioa or iw/ gene 
transeriptkm identified accofding to Ihe medwd of claim 80 or gl «r g2. 

85. * **• — I — r^"'*^ * ' *~' MTrMimt 11 riifiirilim ill iiiiii iii II inn I li III 

by 4 palfaogniic straia of HIV-I and/or far w»iatii« «i fadividwl ■gainif the 
development of AIDS er AIDS-«eUted iffiraaaa, mid f oii^iwdliiwi mwiiiKnjt a non- 

n pathogenic strain of HIV-1 accoidinig to my ooe <tf cLuni 1 to 1< Md oae or more 

phaimaceudcal aecqpiabk carrien aad/or dlhiMiliu 

t6. A tfiet^iieudc oompoaitioB according to daim 85 ntwmiii arid WV-l isolate 
flwmprt— gmitmttf, iiifcMwai iffrprti nr|iiiw ri tin trf ■ntl a Bn ai nr i lb oa j um nnclttrtidc 
•eqiienoea which inhibic produedon of one or aaete praleiM aaooded ty « paOiogeBie 
strain of HIV.I. 



87. A viiai isolate which: 

0) isgeoeticdlyorimmnaologioalfyMlatodtoapadMgcidBatninoflflV-l; 

(ii) is substantiaify non-fMfliogenic la human subjeets; 

(iii) Ottuiirises a first nudeotide sofuaaoe ccmaltoitfag its gnome viMA is 
ci^le of hybridising nndar medium striiigsncy condhtBsis to SEQ ID 
NO: 1 or t caqdemeiitny fbnn ti^nof. wd 

Ov) compiisci a second nadeodda wqucnw widda sdd firrt aneleotide 

sequence and wfakh second iwrjaotidc tocfmoB diraoH «9itiiioii of a 
mRNA molecule capable of iditbhing, mtnrhn cr otfierwiae dowa- 
legulatii^ li«nslali(m of a proteia or pfdypqitido enoodod b^ 
strain of HIV-1 or inhibit, reduce or odierwise down Rgulatn operadon 
of a non-protein encoding a xegioa of a pafliogedc strain of HIV-1. 
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ftS. A virrf isoUle mccoidmg to claim >7 iiihOTin the tMiPted igoteiB or potypeptidc 
if adedad fiomtfie iist oimiistiflig of ffl(s^or|M/« 

S9. A vindiaolate according to chmit7t»^^ 
is adectod from Che list consisting of or IM* 

90. A viral isolate accxmling lo i^laim t7 w he reiu the tsfjalad ptotatA or polypeptide 
is selected finom the list consisting of fw w ifw. 

9L n irtrsT hnlntr ■nrnnllnj: tn rfafm ftT imtinrrin thr rnrfmail jiminfci fa niiiji]u i4hh 
is selected from the list coosisting of ypr. rtfm 

91. A viral isolate aooordiiig to dafaa 87 iMInrrfn the tBiniisa»d pioiiihi k mtf. 

93« A viral iaokte aecording to claim 87 neaiyriiing a n^fflne aMch aufcatmlialty 
does not direct synlheaia of amino acfcb 162 to 177 of NaC 

94. A med>od fhr drtnrnilmug the padiogfpntaHy of an HBM aliaingfteff said HIV>1 
strain infects cells of an individual. saU mated oompriri^ JatansM^ the prrsciicc 
of a deledcm mutaticm hi Ifae gmone of aaid IBV-l whsMia aaid daiaHon mutation 
results in said genome being unaUe lo syirthaahr a polypaplide or protein ficmi a 
patliogenic strain of HIV-1 or directing Ae vnlhcsu of a truncated fimn <^ seid 
polypeptide or protein iirfieiein the presence of a such a nrntMion la iadicalive of the 
presence of a non-pathogenic stirain of HIV«1. 

95. A method accot^ng to claun M whatein laid non-pattg^geide HIV-1 cairiea a 
deletion in its genome of at least 10 audcotidaa widifai <he ngkMi from aneleotide 1787 
to nucleotide 9709 using ^ mideoCide nundMRi^ of HIV«l|g;|}. 



ii(¥iii^<^<^i».niv».US>gai 



251 - 



96. A method acootding to claim 94 wlianfa aaid nnn iiMfciifliiiii iOV-l onkt a 
dektkn in iti gm<mie of at kait 10 fluckotite ftcm wi^^ 
Kit conaiititig ot: 

nucleotide (0 SS30-tM2; 

Cu) 900»4035: 
(iii) 9019U9029: and 

(Iv) 903S-9049. 



97. A mediod accocding to eUm 94 Mimitm wmd non iMJimm iiifi HIV-l 
deletion in its genome of at leait 10 anrleotidae iNm widrfn a lagiaa ariieted from die 
lift coaaiitiiig o£ 

nucleotide (v) 92tl-«S71; 

(vi) 92I1-«SC2: 
(viO 9105-9224; and 

(viiO 9271««)7D. 



9S. A mediod aceoidiiig to claim 94 wim ai n aaid won mhagridn nV-l eanfaa a 
deletion in iti genome of at lean 10 ■uclcBtifca iwm wMiin « aggjon mtttrtnd fiuiu die 
Ibt eoniialing o£ 

meieotide Ox) ttt2492t: 
(x) SSS04006: 
(n) S792-9041; and 

(xii) 9112^9204. 



99. A metfud according to daim 94 yAsaeim md non paHwgfiiihi MV-l cacriea a 
ddetion m its genome of at least 10 indeotidit llnin widiiB n Mi^^ 
list consisting ot. 

nucleotide (xiii) 9105-9224; 

(xiv) 9389.9395; and 

(XV) 92S1-9366. 
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100. A method «eco«diag to claim 94 vilmmm wid Mttiim wr niiniiwM M least one 
of the following decanodeoddes fiom tka mf pmc at HIV-I)||^: 





ATOOOTOQCA 


(SBQ ID NO: 2) ; 


TOKRiiaacjuk 


(SBQ 


ZD BO 


i3); 






(SBO ID 190:4) t 




(MQ 


ZO MO:5) ; 






/ano ID NO 2 6) 7 






\ ZD 


MD:7) ; 






/SEQO ID NO: 8) s 




(MO 


ID MO 


:9) ; 






f^lfiO ZD NO: 10) ; 

%wMr%^ dbV MWa*w# f 






ID 


HO: 11) ; 






iS&Q ID NO 2 121 > 




(«BQ 


ZD 


NDxia) ; 






fSBO ID NO: 14) i 


iMTOUyiAAff 


(MQ 


ZD 


NO: 15) ; 






iSBQ ID NO:lC) / 


TCaUUUUUSTA 


(MQ 


ID 


MOil7) ; 




GAAAAAOTAO 


(SEC ID NO: 16) ; 


AAAMkOTAGT 


(S«Q 


ID 


NO: 19) ; 




JUUUUSTAGTQ 


(SEC ZD 110:20) ; 


AAMmunGT 


(MQ 


ZD 


NO: 21) ; 






(SEO ZD NO:22) : 


AGRBMSCTZQA 


(OTQ 


XO 


N0:23) 1 






<SBO ZD NO: 24) : 


TMGHPSTGSATT 


(OTQ 


ZD 


N0t25) 1 






(SBO ZD NOt2C) I 


(JVOSUATTOO 


(MQ 


ZD 


N0:27) 1 




•TnTOXTTaQX 


(SBQ ID NO: 20) J 


OVSMTEOQAT 


(MQ 


ZO 


M0t2») 1 




TGXTTGGAT6 


(SBQ ZD NO:30) i 


QATtOQATOG 


(MQ 


ZD 


N0t31) ; 






(SBO ID NO: 32) i 


TIMATGKSCC 


(MBQ 


ZD 


N0t33) ; 






(SBO ZD NO: 14) t 

\a3^^^f m^^^ • ^^Bf f 


QOAWWPCTG 


(MQ 


ZD 


N0:35) ; 




GATOOCCTQC 


(SBQ ZO NO lie) 1 


AWMKNSIXICT 


I4SQ 


» 


N0s37) ; 




TGOOCTOCTa 


(SBQ ZD NOtlA).; 




(MO 


SO 


N0:3») ; 




aCCTOCTOTA 


(SBQ ZD llOt40) 1 


GCRKTOTAA 


(«BQ 


ZO 


MDx4X) ; 




CTOCTOTAAG 


(SBQ ZD IIO>42) t 


TOCIOTAAGKi 


(MQ 


ZO 


llO:43) ; 




GCTGTAA06G 


(SBQ ZD NO: 44) i 


CTODUUIGKIA 


(MQ 


Tn 


MU : 49/ r 




TOTAAtSaOAX 


(SBQ ID NO:4fi) ; 


OTAAaQOAHA 


(MQ 


ZD 


M0:47) ) 




TAAOQGAAAO 


(SBQ ZD NO: 48) r 


AABOuAAauA 




ZD 


NO: 49) ; 




AGGGAAAGAA 


iSBy XU HCi:30| / 




(ABO 

%MOiyE 


ZD 


NO: SI) ; 






(fiSO ID NOi52) 1 


OIUUiaAATOA 


(SBQ 


ZD 


NO: 53) ; 




iwWUSA/1 1 VIAU 


(fiR() ID HO"S4): 


AAOAAraAGA 


(MQ 

»^^»as 


ID 


MO: 55) ; 




AOAATSAOAC 


(SEQ ID N0:5€) ; 


Guukxaxoxco 


(MQ 


ID 


NOtS?) ; 






(SEQ ID NOtSt); 


ATGAOAGOAa 


(MQ 


ID 


MO:59); 




TOAOAcmac 


(SEQ ZD lilO:€0) ; 


OMMOaAOCT 


(MQ 


ID 


MO: 61) ; 




AOACGAGCTO 


(SEQ ID MO: 62); 


GMXUUSCTGA 


(MQ 


ID 


110:63); 




ACGAGCTGAG 


(SEQ ID NO: 64); 


(SOMICraACSC 


(SBQ 


ID 


110:65); 




QAOCrOAGCC 


(SEQ ID NO: 66) ; 




(MQ 


ID 


MO: 67) ; 




GCTGAGCCAO 


(SBQ ID MO: 68) ; 


CTOAGCCAGC 


(SBQ 


ID 


MO:69) ; 




TOAGCCXOCA 


(SBQ ID NO: 70) | 


GAOCCAOGAO 


(8BQ 


ID 


M0i71) ; 
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MSCauSCMX (SEQ ID MO:72); 
CCAOCAOCAO (SEQ ID M0:74) ; 
AOCAOCAa&T (8BQ ID UOtlS) s 
CAaCAGATOO (SSQ ID MO;78) ; 

ocAOATGaaa (sbq id irO:80); 
AQATQQQQTQ (SBO ID 110:82) ; 
ATOaOCrrGCiG (sbq id >R»:84); 

GoaafraoaAa (sbq id mo:8«)/ 

GGTGOOAOCA (SBQ ID NO:88) ; 
TGGGAGCACFT (SBQ ID HO:90) ; 
GOAOCAaTAT (SBQ ID MO :9a); 
AOCACrTATCT (SBQ ID NO: 94); 

CAcrrATcrco (sbq id iiot9«)/ 

GTATCTCaAa (SBQ ID »Ot98)f 
ATCTCXSAOAC (SBQ ID NO:100) | 
CTOaAGACCT (SBQ ID NO; 103) ; 
COAOACCTAa (SBO ID NOil04) ; 
AOACCTAOAA (SBQ ZD VO:X0<); 
ACCTAOAAAA (SBQ ID KOtlOS); 
CTAOAAAAAC (SBQ ID MO: 110); 
AOAAAAACaiT (SBQ ZD MO 1 113); 
AAAAACATOO (SEQ ZD MO: 114); 
AAACATGGSAS (SEQ ID MOtlK); 
ACATOOAaCA (SBQ ID MO: 118); 
ATOOAaCAAT (SBQ ZD MO: 120) ; 
aOAOCAATCA (SEQ ID MO: 122) ; 
A6CAATCACA (SBQ ID NO:124) ; 
CAATCACAAO (SBQ ID MOil2C) ; 
ATCAGAAQTA (SBQ ID MO: 128) ; 

CACAAGTAOC (SBQ ID NO: 130); 
CAAQTAGCSA (SEQ ID MO: 132); 
A6TAGCAATA (SEQ ID NO: 134); 
TAOCAATACA (SEQ ID M0:13€); 
GCAATACAOC (SBQ ZD MO: 138) ; 
AATACA0CA8 (SEQ ID NO:140) ; 
TACAOCAGCT (SEQ ID NO:142) / 
GAGCAlSCTAA (SEQ ID MO: 144) ; 
GCAGCTAACA (SBQ ID NO: 146) ; 







ID 


MO:73) ; 






ID 


MO:7S) 1 


GKSMPCMATO 




ID 


MO: 771 • 




(MBQ 


ID 


1K>:79) ; 


cftoiksaaaQT 


(flEBO 


ID 




oAracicMnnas 




ID 


M0z83) ; 






ID 








ID 


MO: 87) ; 


OTGKNUUSCakO 


(m 


ZD 


M0:a9) ; 


oaoAacAam 




ZD 


MO: 91); 


CUMKahOTAXG 




ZD 


110:93) / 


OCROrATCXC 




ZD 


II0!95) ; 


AOfBATCTCQA 


(•BQ 


ZD 


■0:97) / 


TAVCTOSAOA 


<*BQ 


ID 


N0t99) ; 


TCVCSaAOAOC 


(BBQ 


ZD NO: 101) ; 



■AViMWMmJ^ AJIk 


(MK) 






\ XU3/ ; 






XII 






<IA06nU3AAA 


(OTQ 


ZD 


MO: 


tl07) ; 


CCIiUSAAAAA 


(MQ 


ZD 


■Oi 


1109) ; 




fCK 


ZD 


MDi 


till) t 


dMMJUlCSATO 


(M 


ZD 


mi 


tll3); 


AAAACASraOA 


(MO 


ZO 




tllS) f 


AACKioaAac 


<MQ 


ZD 




tll7) t 


CATOGAGCAA 


(•■Q 


ZD 


mo\ 


119} f 


TOOMSCAATC 


(saQ 


ID 


MO] 


121) t 


QAOCAATCAC 


(MQ 


ZD 


MOi 


123) ; 


OCAATCACAA 


(SBQ 


ID 


HOi 


125) ; 


AATGACAAOT 


(•■Q 


ZD 


MO] 


127) , 


TOkCAAOTAia 


(MQ 


ZD 


MO: 


129) ; 


ACMOTAra 


liSQ 


ID 


MO: 


131); 


AAOTAOCAAT 


(m 


IB 


MO: 


133); 




(m 


ZD 


MO: 


135) ; 


AflCaUkTACAa 


(sao 


ZD 


MO; 


!l37) ; 


CAA13iCAaCA 




ID 


MO) 


il39) ; 


ATACAQCaWOC 


(OBQ 


ID 


MOi 


il41)i 


ACiAOCAOCTA 


(m 


ID 


HOi 


!l43) ; 


AOCAGCTAAC 


(SBQ 


10 


MO 


!l45) ; 


CAG(?rAACAA 


(SEQ 


ID 


NO 


!l47) ; 
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iUSCTAACJUlT (SBO ID 110:148) ; 
CTAACIATGC (SBQ ID NOtlSO); 
AACAATQCTO (SBQ 10 N0:152) ; 
CJUkTOCTOCT (SBQ ID IIOtlS4); 
ATOCraCTTG laSQ ZD MO : 154); 
QCTQCTt&ra (SBQ ZD I10:1SS) ; 
TOCTTCTCGC (SBQ ZD NOtX(O); 

cTTorrGoera (sbq id iio:i«3)i 

TQTOCCrraaC (SEQ id H0$1S4) ; 
TaCCTGGCTA (SBQ ZD ilO:lC<) ; 

ccroacrrjiGK (sbq id iioiict); 

TOacmaAMS (sbq id NO:170}; 

OCTAOIVAaCA. (SBQ ID N0>17a) f 

TMUUUSCACA (SBQ ID 110:174) ; 

QAHaCACMa (SBQ ID MO:17<>; 

AOCACAAaAG (SBQ ZD MOfl7«) } 

CACAAOAOGA (SBQ ID NOtlSO) i 

GAAOAOGMOO (SBQ ID MOtlta) ; 

AOAOaAOaAA (SBQ ID 1I0$184)| 

AaaAGOAAOA (SBQ ZD HOtlSS) » 

oAOOOiAaAiaa (sbq zd moiiss) t 

OaAAOAOOTa (SBQ ZD N0(1»0) « 
AAOAOGTGaa (SBQ ZD MOslf2) | 
aAaOTGOGTT (SBQ ZD M0:lf4) i 
OOTGnaTTTT (SBQ ZD NOtl9€) ; 
TOOaTTTTCC (SEQ ID NO: 198); 
OOTTTTCCAa (SBQ ZD 110:200)) 
TTTTCCAGTC (SBQ ZD 110:202) | 
TTCCA8TCAC (SBQ ZD MO: 204) ; 
CCAOTCACAC (SBQ ZD NO:20S) ; 
AOTCACACCr (SBQ ID MO: 208) i 
TCkCf^CCrCK (SBQ ID MO: 210) ; 
ACACCTCACK3 (SEQ ID N0tai2); 

AcxrrcAoaTA (sbq id mo : 214) ; 

CTCAOQTACC (SBQ ID M0:21«)| 
CAGOTACCTT (SEQ ID N0:218) ; 
QGTACCnTA (SEQ ID NO: 220); 
TACCTTTAAO (SEQ ID NO:222) ; 



261- 

aCTAACAATO (SBQ ZD MO: 149) ; 
TAACAAIGCT (SBQ ID MO:151) ; 
ACAATOCTOC (SBQ ZD 110:153) / 
AAXOCTOCTT (SBQ ZD TO:15S) / 
TOCXOCTTOr (SBQ ID NO: 157)/ 

cnacnxm ic (sbq zd iio:159) ; 

OCTTGTGCCT (SBQ ZD HOtlCl) ; 
mnPGOCTOO (SBQ zd MOtlC3); 
OraOCTOOCT (SBQ ZD NO: ICS) ; 
OOCVOQCXAa (SBQ ZD liO:lC7); 
GlOeiCTAaAA (SBQ ZD NO: 169); 
aacrAOAAfiC (SBQ zd N0:171); 
CXIOAAOCAC (SBQ ZD NO:173) ; 
AOAAOCACAA (SBQ ZD NO:175) ; 
AA fl Ca m AA O A (SBQ ZD NO:177) ; 
dCACAAOAOO (SBQ ID 110:179); 
MeMOMdM (SBQ ZD NO: 181); 
AAOAOaAQQA (SBQ ZD IK>il83) ; 
(BKSttAflGAAfl (MQ ZD 90:165); 

qdytflOAAflAa (bbq m MOtis?) ; 

AMAAOAOOr (SBQ ZB MO: 169) ; 

amaAoarqa (sbq zo mo:191) ; 
AaAOGrraacir (sbq zo MO1193) ; 

ASOrrOGOTTT (SBQ ZD MO: 195) ; 

ODOOGrrrTTC (raO zd 1I0:197) ; 

OSOrrTTCCA (SBQ id If0tl99); 

arxTTccAOT (msQ zd MOtaOl) ; 

TrrOCAOTCA (SBQ ZD M0:a03) ; 

TOCAGTGACA (SBQ ZD MOtaOS) ; 

GAdOCAGAGC (SBQ ZO M0:a07) ; 

mftCACCTC (SBQ ZD 110:209); 

auoiccraia (sbq id i»:2ii); 

CAGCTCAGOT (tlQ ID 110:213) ; 
CXnCAOGTAC (SBQ ID 110:215) ; 
TCAOOTACCT (SBQ 10 MO: 217) ; 
AOGrrACCITT (SXQ ID 100:219); 
GTACCTTTAA (SBQ ID MO:221) ; 
ACCTTTAAOA (SBQ ZD MO:223) ; 



SENT BY:Davles Co III son Cave ; 6- 6-95 ; 18:37 ; 
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CCTTTAAGAC (SBQ ID VO:324) i 

TPtJAQhCCK (SBQ ID IIO:22<) ; 

TAAOKCC&AT (SEQ ZD 110:228); 

AGACCAATGA (SBQ ZD 110:230); 

ACCAATOACT (SBQ ID 110:232) ; 

CAA.TGBVCTTA (SBO ID 110:234) ; 

ATGACTTACA (SBQ ID IK>:23«) ; 

GACTTACAAO (SBQ ID 110:239} ; 
CTTACAAOOC (SBQ ID MO: 240) ; 
TACAAOQCAG (SEQ ID NO:342) ; 
CaUhOaCAGCT (SBQ id 1I0:244); 
AOGCAOCTOT (SBQ ID llO:24C) ; 
QCaUiCraTAa (SBQ id no : 248); 
AaCTGTAOAT (SEQ ID 1K>:2S0); 
CTOTAGATCT (SBQ ID 110:252); 
OTAGATCTTA (SBQ ID NO: 254) ; 
AaATCTTAOC (SBQ ID NOt2S«) ; 
ATCTTACKXA (SEQ ID llO:25«) ; 
CTTAGCGACT (SBQ ID 1K>«2C0) ; 
TAOOCACTTT (SBQ ID Il0x2«2) ; 
0CCACTTT7T (SBQ ID B0:2<4) ; 
CACTTTrTAA (SBQ ZD liO:2«); 
CTTTTTAAAA (SBQ ID 1I0:2C8) ; 
TTTTAAAAOA (SBQ ID 110:270); 
TTAAAAOAAA (SBQ ZD 110:272) ; 
AAAAGAAAAO (SEQ ID NO: 274) ; 
AAOAAAAGGG (SBQ ID NO:37€) ; 
OAAAAOGOGG (SBQ ID llO:27t) ; 
AAAGOGGOGUt (SBQ ID liOt200) ; 
AaaOGOQACT (SBQ ZD NO: 282); 
aaaOGACTQa (SBQ id N0:284)i 
OOQACTOGAX (SBQ ID N0:28€); 
GACTaOAAGG (SBQ ID NO:288) } 
CTOOAMKJGC (SEQ ZD NO J 290) ; 
(SOAAOGGCTA (SBQ ID NO:292) ; 
AAGQQCTAAT (SEQ ID N0t294) ; 
GQGCTAATTC (SEQ ID K0:296) ; 
GCTAATTCAC (SEQ ID NO:298) ; 



CTTTAAOACC (SBQ ZD 110:235) ; 

TTAAOACCAA (SBQ ZD N0:227} ; 

AAOACaUlIO (SBQ ID N0:229) ; 

GAOCAATOAC (BBQ ID NO:231) ; 

OCAAIOACTT (SBQ ZD 110:233); 

AATOACTTAC (SBQ ID HO: 235) ; 
VOACITACAA (SBQ ZD 110:237); 
ACmCHJUaa (am ZD H0:239); 
TTACAAOGCA (SBQ ZD N0:241) ; 
ACMOgCAQC (SBQ ID Il0i243) ; 
AMMCAOCTO (SBQ ID N0:24S) ; 

oQcnacTOTA (amo id 110:247); 

GMOOXAOA (BBQ lO MO: 249) ; 

aevoniaATc (sao zo 110:251); 

TmOATCTT (BBQ ZD MO:2S3) ; 
TAOAiVCTTAO (SBQ ID MO>2S5) ; 
OAICTZAOCC (SBQ ZO N0:257) ; 
TCRSAOCCAC (SBQ ID Ii0:359) ; 
nMOCACrr (BBQ ZD N0:2€1) ; 
ABOGACTTTT (MO ZD IK>:3fi3) ; 

oimerrmK (mta zd MOtacs) ; 

ACnTTTAAA (SBQ ZD 110:247) ; 

TTRTAAAAa (MQ ZD ■0:3«9} ; 

mAAAAOAA (no ID N0:271) ; 

TAAAAOAAAA (SBQ ID M0:273) ; 

AAAOAAAAOO (SBQ ID MO:37S) ; 

AOAAAAOOaa (SBQ ID NO: 277) ; 

AAAAOdOGGO (SBQ ZD 110:279) ; 

AMWQQOOafcC (SBQ ZO liO:28X) ; 

QCmmCSa (m id M0:2a3) ; 

CKMUACroaA (HQ ID 110:285) ; 

aaKCTOaMa (SBQ id Il0t287) ; 

ACTOOAAGOa (8BQ ID NO: 289); 

TGMAAOaaCT cm ZD lK):i2Sl) f 

OlUyaCiaCTAA (OEQ id )I0t293) ; 

AaCSOCTTAATT (SBQ 10 110:295) ; 

GOCTAATTCA (SBQ ID NO: 297); 

CTAATTCACT (SBQ ID NO: 299) ; 



SENT BY'-Oavles Colllson Cave ; ^" ' ^^'"^ ' 
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TAATTGACTC 

ATTCACTOCC 

TCACTOa»A 

ACTCOCSUUiO 

TCCCXAAOAA 

CCAAAOAAGA 

AAAGJUMUUZA 

AOAAOACAAO 

AAOACJVAOAT 

CUiCAAGATAT 

CAAGATATCC 

Ai3ATATCCTT 

ACATCCTTOA 

ATCCTTOATC 

CCTT0ATCV6 

TTQATCTOTO 

GATCTGTOaX 

TCTOTGQATC 

TOTOGATCTA 

TOCWXCTACX! 

OATCTACOU: 

TCTACCACAC 

TAIXACACKC 

CCACACACAA 

ACACACAAOO 

ACACAAGGCT 

ACAAOGCTAC 

AAOCSCTACTT 

G6CTACTTCC 

CTACTTCCCT 

ACTTCCCTOA 

TTCCCTQATT 

CCCTGATTGO 

CTOATIXIQC^ 

GATTOOCAGA 

TTOGCAfflUlC 

GGCAGAACTA 

CAGAACTACA 



(SBQ ID NO f 300) ; 

(SBQ ID liO:302) ; 

(SEQ ID MO: 304) ; 

(SBQ ID 110:300 ; 

(SBQ ID N0:30f) ; 

(SBQ ID ilOs310) I 

(SEQ ID 190:312) ; 

(SBQ ID 110:314} / 

(SBQ ID 110:316) ; 

(SBQ ID 110:318) ; 
(SBQ ID MO I 330) ; 
(SBQ ID MOt332} / 
(SBQ ID 110:324) t 
(SBQ ID MO:33€) ; 
(SBQ ID 1K>:32«)| 
(SBQ ID 110:330) ; 
(SBQ ID 110:332); 
(SBQ ID 110:334) ; 
(SBQ ID Il0i33€)i 
(SBQ ID S0i33t)/ 
(SBQ ID mt340) } 
(SBQ XD N0i342) / 
(SBQ XD 110:344) | 
(SBQ ID 110:340 / 
(SBQ ID NO: 348) ; 
(SEQ ID NO: 350) ; 
(SEQ ID NO:3S2) ; 
(SEQ ID NO: 354) j 
(SBQ XD NO:35S) / 
(SEQ ID NO: 358) ; 
(SBQ XD MO:360) i 
(SEQ ZD NO: 362) ; 
(SBQ ID NO>3C4) ; 
(SBQ ID KO:3«6)| 
(SBQ ID NO: 368) ; 
(SEQ ID KO:370) ; 
(SEQ ID NO t 372) } 
(SEQ ID NO:374) ; 



263- 

AATTCACTOC (BBQ ID NO: 301); 
TTCACTGOCA (SBQ XO HO:303) ; 
CACTCOCAAA (SSQ ID HO: 305) ; 
CTrOCAAAOA (SBQ XO NO:307) ; 
OC C AAAQAAtf (SBQ XD 110:309) ; 
CaU U MUU I flA C (CBQ ID 110:311) ; 
AAOMOACAA («BQ ZD NO:313) ; 
OAAGACAAaA (CBQ Ta> llO:31S) ; 

JusiuaMuah (sbo id noisi?) ; 

AC9UU3A.TATC (SBQ XO NOtSlS) i 
AMATATCCT (SBQ XD MO:321) / 
OAXATOCnO (SBQ ID NO: 323) ; 
mVOCnOAT (SBQ id B0:325) i 
tCCTtOKKX (OQ ZO NO:327) ; 

cnaokxCBcer (sbq xd H0t32») t 

TOIUrCTOTOO (SBQ ID NO: 331) $ 

ATeratOOAT (BBQ ZD M0x333) i 

ClOIiaQATCT (SBQ ZO NO: 335) ; 

OVMAICTAC imn ZO IK>t337) ; 

(•■□ ZO «0<S39) ; 
(SBQ m 110:341); 

ClAOCACACA (BBQ ZO iK><143) ; 

ACeiWCACACat (BBQ ZO M0x345) ; 

CMakCACJVAa (sbq X» H0:347) ; 

CAGRCAAOOC (SBQ XO HO: 349) ; 

CauaUkOdCTA (SBQ id HOiSSD; 

<::AAOaCTACr (sbq ZO N0:353); 

AaaCCACTTC (SBQ ZO 110:355)1 

OCTBCTTCOC (SBQ ZO NO: 357) ; 

TACnOCXZTO («B0 ZO MOt3S9); 

CTlO(3CIGAT (BBQ ZD WOtSSl) f 

TCCCTOATTO (MQ ZO 1K>:363); 

CCtaMTQQC too ID NO: 3(5) ; 

TOATTOGCAa (SBQ ID 110:3(7) ; 

ATXOOCAGAA {8IQ ZD 110:369) ; 

TGOCAOAACT (SBQ ID MO: 371) ; 

OCACIAACTAC (SBQ ZD NO: 373); 

AOAACTACAC (SBQ ID NO: 375) ; 
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OAXCTACACX (SBQ ID II0:37C) ; JkACTACAOU: («BQ ID MDi377) 

ACTACACACC (SBQ ID NO:a7t) ; CliACSMCMXA {SBQ ID 110:379) 

TAGACftCCftO ($BQ ID 110:380); ACMACCKOO (MEQ XO HOt381) 

CACACCAOOa (SEQ ID KO:382) ; jUatCCMOflC (CBQ ID MOt383) 

GACCAOOOOC (SBQ ID 110:384); KX%iMMtXaL <MQ lO 110:385) 

CCWSOOCCae (SBQ id 1I0:38€); CMOKICGMlll (8BQ ID ■Os387) 

AOGWCCaaaO (SBQ id N0;388); aOOGCaUXXKI (mo ID N0S389); 

OQCCAOGOOT (SBQ ID NO:3»p); OOdUIOaaiC (CBQ lO llO»391) j 

CCaflOCWTCA (SBQ ID 110:392); CRaOGKITCMI «BQ ID M0i393)j 

AaoaOTCMA (SBQ ID NO: 394) ; OOmCMOKT («Q XO MOt39S) i 

(SaatCMKSK (SBQ ID Ii0i390; OOrOUSiTAT (MQ XD 110*397) i 

arCROATATC (SBQ ID 110:398); TCIiaATATCC («BQ ID 110:399); 

GAOATATCCA (SEQ ID 110:400) ; AOATATOCAC (SBQ ID M0:401) 

OATATCWACT (SBQ ID MO:402) ; AXWHOOiClO («B0 ID ■Ot403) ;' 

TATCXrACTOA (SBQ ID N0:404); AVOOiCnSAC (SBQ XO ■0:405) ; 

TCaCTGAOC (SBQ ID M0:40C) ; OCMCTOAOCT (SBQ XD MO:407) ; 

CACTQ ACeT T (SEQ ID 110:408); ACXOAOCTTT (SBQ XO BOi409) ; 

CTGftCCTTTa (SBQ ID NO:410) ; TOAOCTTTOa (SBQ XD MO:411) ; 

OACXnTOSOA (SBQ ID BO:412) ; AOCmOGAT (MQ ID MO:413) • 

CCTTTOGAWl (SBQ ID llOt414) , CmnOMOa (BBQ HI Il0t415) ;' 

TrroaATaOT (SBQ id N0*41<) ; TtBOAIOOia (BBQ XB B0t417) ; 

TOaATOGRRiC (SBQ XD NOi41«) ; KUmaitQCt (BBQ JM MOt419) ; 

OATOOTaCTA (SBQ ID NO:420) ; ATQOTOCTAC (BBQ XD 1101421)!' 

TOOTOCTACA (SBQ ID 110:422); OOiaCTACliA («flQ ID MO:423} ) 

GTOCTACAAa (SEQ ID 110:424); TaCTAOttflC (SBQ ZD HO: 425) I 

OCTACAAOCT (SBQ ID N0:42C) ; CIIWCAAOCTA (SBQ ID MOi4a7) ; 

TACAAtSCTAa (SBQ ID NO:428) ; ACMUKTAOT (SBQ ID 110:429); 

CAAOCTAOTA (SEQ ID NOt430) ; AAQCnAOTAC (SBQ ID MO: 431) ; 

AflCTAOTACC (SBQ ID MO:43a) , aCTAffTACCA (SBQ XD HOMSa) ; 

CTAOTACXaO (SBQ 10 NO:434); TJUmUXAGT (MQ 10 110:435), 

A0TA(S2MrrT (SBQ ID M0!43<); mSXMn^ (MQ » BOMS?); 

TACCaOTTGX (SBQ ID KOt438) ; ACeMTTGAO (aiQ » 10:439) i 

COUnrrOAflC (SBQ id 110:440); GMnrxOAGGC (OQ ID 110:441)! 

AOTTOAGKrOl (SBQ ID NO!442); OTT(»GC(»a (MQ 10 HO:443) i 

TTOAGCCAGA (SEQ ID N0i444) , TGMCXaUSAT (SlQ ID W0:445) , 

OAGCCAGATA (SEQ ID NO:446) ; JUKXAOATAA (SBQ ID «0:447) ; 

GSCCAOATAAQ (SEQ ID NO!448); CSCMATAAflO (SBQ ID 110:449) ] 

CAGATAAOGT (SEQ ID NO:450) ; AOATAAGOTA (SBQ ID 110:451); 



SENT BY:Davles Co III son Cave ; 6- 6-95 ; 18:39 ; 



ViELBOLliNE- 
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GATAMGTAO (SBQ ID lK)i4S2} ; 

TAAOCTAGAA (SBQ ID 110:454); 

AOOrAGMflA (SBQ ID SK>:45C) | 

afrAOKhaiiQa (SBQ id N0:4S«); 

AOAAOAaaCC (SBQ ID liO:4«0) ; 

AAOIUKKXAX (fiBQ ID MO:4«2) ; 

OAOOCCAATA (SBQ ID HO:46A) ; 

GOCCSUllAAA. (SBQ ID Ii0:4$$) ; 

CCAATAAA06 (SBQ ID llO:4«a) ; 
AATAAAOaAO (SEQ ID NO: 470} ; 
TAAAOOAOAO (SBQ ID 110:472); 
AAGOAaAOAA (SBQ ID NO: 474) ; 
GOAOftOAACA (SBQ ID HO:47C) | 
AOAOIUkCACC (SEQ ID NOt47«) j 
AOAACACCAO (SEQ ID N0t4t0) i 
AACACCAOCT (SBQ ID NO: 4^2) ; 
CACCAGCrra (SBQ id N0:4«4) ; 
CCAOCTTOTT (SBQ ID liO:48<) ; 
AaCRGTTAC (SBQ ID HOt4«S) i 
CnVf T A CAC (SBQ ID N0t490) ; 
T0TTACAOCC (SEQ ID N0i49a} ; 
TTACACCCTO (SBQ ID N0i4f4) » 
ACACCCTOXa (SEQ ID KO:4»<) ; 
ACCCTOTOAS (SEQ ID NOt49t) ; 
CCTOrOAGCC (SEQ ID NO:S00) ; 
TGTGAOCCTO (SBQ ID NO: 503) ; 
TQAGCCTQCA (SEQ ID NO: 504) ; 
AOCCTOCATG (SBQ ID NO: 506); 
CCTGCATOOA (SBQ ZD NO: SOS) ; 
TOCATG^AAT (SBQ ID liIO:S10); 
CATQtSAATQO (SBQ ID N0:S12)| 
TOCUmTGOAT (SEQ ID NO: 514); 
GAATGCSATOA (SBQ ID NO: SIC); 
ATGGATGACC (SEQ ID NO:518); 
GQATOACCCT (SEQ ID NO: 520); 
ATGACCCTQA (SEQ ID NO: 522); 

GACCCTGAOA (SEQ ID NO:S24) ; 

CCCTGAGAGA (SEQ ID NO: 526) ; 



-265 - 

AtAAadTAQA (SBQ ID NO: 453) ; 
AMSRAGMUI (SBQ ID NOt455) ; 

omatkMoika («bq id 110:457) ; 

TAmUhGAOOC (SBQ ID MO: 459) ; 
QIUiaAqgCCA <«iO XO ■0:4€1); 
AOMOeSClUkT MQ ZD MO: 463) ; 
JIOaOCAATAA (MMQ ID NO:465) ; 

ocxaaoMMa ima id ho : 467); 

GAATAAAGBA (MBQ ZD M0:4€9) ; 
ATAAAOOIUSA (MBQ ZD H0i471) ; 
AAAOaAaAOA (MBQ ZD MOx473) ; 
ASMaibaiU^ <MBQ ZO N0t475) ; 
OMMOlUkGAC (mo 10 M0<477) ; 
OMHUkCACeA («BQ ZD N0t479) ; 

omcKcciMac (mo m M0t49i) ; 

AOmeCAOCTT (SBQ ID MOi4») ; 
AOakOCROT (SMQ ZD ll0t4«S) ; 
OM S r VaVO L (mo to HOi 4G7) ; 
OerVOTTACA (SMQ ZD MOt4»9) ; 
TTOrnCAOC <MQ ZD IK>t491) ; 
OTfACAOCSCT <MQ X» ■0»493) ; 
TMAOOevOT (SaO Z& MOt49S) ; 
CAQOCTOiaA («BQ ID M0s497) ; 
OOeiOraASC (SBQ id M0:499) ; 
CTOraAOCCT (SBQ ID M0:501); 
CrrOMIOCTaC (SMQ id M0tS03) ; 
OAQCCTOCAT (SBQ ID NO: 505) ; 
OCCXOCATOO (SBQ ZD 110:507); 
CraGATOGAA (SBQ ID MOtS09) ; 
aCMKXUUlTO <«BQ ID MO: 511) ; 

AxaaMxaoA (m xb Ne>:si3); 

(XSAKVQGUITO (iBQ ID MO: 515); 

AMNttATOAC (flQ ID 110:517); 

TOCykTGACCC (SIQ ID MO:519) ; 

aKTQhCCCtG (SBQ ID M0:521); 

TQACCCTGAO (SBQ ID M0:B23) ; 

ACCXnOAOAGI (SBQ ID M0:52S); 

CCTOAGAGAO (SBQ ID NO: 527) ; 



SENT BY:Davles ColUson Cave : 6- 6-35 ; 18:39 
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CTaAGAGAOA (SEQ ID MO: 528) ; 
GAGAOMAAO (SBQ ID MO: 530); 
GAGAOJUUnO (SSQ ZD MOt532) ; 
GMUUlQTaTT (SBQ ZD MOt534); 
OktUSTCrHTAO (SBQ ZD MO:53«) ; 
AOZOTTAOAia (SBQ ID MOt536)/ 
rdTTAOMOTO (SBQ ZD HO:S40) | 
TTACnaraaA (SEQ id N0:542) ; 
AGaOTCKZAOO (SBQ ZD MO: 544); 
AaTaaAOGTT (SBQ ZD M0t54«); 
TaaAOGTTTa (SEQ id M0:S4t> ; 
GAOOTTTGAC (SBQ ID M0:5S0); 
aOTTTGACAG (SBQ ZD N0t552) ; 
TTTOACAaCC (SBQ ID MO:SS4); 
TOACAaCOGC (SBQ ID MO:5SC) j 
ACAOCOaCCT (SBQ ZD MOtSSS); 
AOCCQCCTAa (SBQ ID MOtSCO) f 
OOSCCTAOCA (SBQ ZD HOtS«2) i 
GCCTAOCATT (SBQ ID irO]5(4); 
CTAOOkTTTC (SBQ ZD M0:5«<) f 
A6CATTTCAT (SBQ ID lfO:S<t); 
CKmCATCK (SBQ ZD HO: 570)/ 
TTTCATCJiCO (SBQ ZD M0:S7a) / 
TCA.TCACGT8 (SBQ ZD NOiS74) f 
ATCACCrrOGC (SEQ ZD M0:57C)| 
CACOTOaCCC (SBQ ZD NOt57«) / 
CXSTOOCCCXaA (SBQ ZD M0:5a0) ; 
TQOCCCXaAOA. (SEQ ZD NO:5«2) ; 
GCCCQAOAOC (SEQ ZD MOtS84) ; 
CCQAGAOCro (SBQ ZD MO: 590 ; 
GAGAOCTOCA (SBQ ID NOtSaS); 
OAOCTOCATC (SEQ ID 110:590) ; 
GCTQCATCrS (SBQ ID 110:592); 
TGCATCCOOA (SEQ ID tK>z594) i 
CATCCa(3A0T (SBQ ID N0:596) ; 
TCCGGAGTAC (SEQ ID N0:598) ; 
CGGAGTACTT (SBQ ID NO:600} ; 
(SAOTACrrrCA (SEQ id NO:602); 
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TaAOMAGAA (SBQ ZD M0:529) ; 
AOMAOIUIOT (SMQ ZD MOtS31) ; 
MmOMOm (MQ ZD ■Ot533); 
AOMOIOTTA (mo ZD ■Ot535) ; 
AMIRfrTMA (SBQ ZD MOtS37) ; 

cnoRMUor isbq zo aotsas) ; 

OnRMtAOTOO (SBQ ZD MO{S41) / 
TMMOiaaAO CSBQ ZD MDt543); 
QMOrataoOT (OQ lO MOt545) / 

amnaam (mq zo ■02S47); 

MMOmCA (MQ ID B0:S49); 
MCRTTGACA (BBQ ZD MO: 551) ; 
OTmACAOC (BBQ ZO MO<5S3) ; 
TNttCAOOOS (ilQ ZD MOtSSS) ; 
OACMXXXiOC («BQ IB MOi557) ; 
CBBOOOOCfA (SBQ ID MDtS59) ; 
qOOSeClS A OC (SBQ ZB BDiSCl) i 

ooeeiMiCMr (sbq tn MOtso); 
oeekocKm (mq zb votscs) ; 

maeATTTCA (MQ id aOtSC7) ; 

aaamcMse (bbq ib boiscs) i 

ATRCAICAC (SBQ ZB B0tS71) ; 

rwaaaum (sbq xb iK>tS73) $ 

GATOUXTrOO (SBQ ZD MOtS75) ; 

TCAOOXOOOC (MQ ZD MOt577) ; 

AOBXOGCCXn (SBQ ZD MO>579) ; 

OMOOCOOAa (SBQ ZO lX>t581) ; 

oocxxxiAona (sbq io moisss); 

OOOOMAOCT (SBQ ZD MOiSSS) ; 
(SOMnOCraC (mq ZD mo : 587) ; 

jmacTQoa (mq id iiOt5a9); 

AOCIOCATCC (MQ ZO MO:591); 

CraCATGCXIO (tBQ ID NO: 593) ; 

aaaCOOOM (SBQ id N0:595); 

AfOOQQAOTA (SBQ ZD MO: 597) ; 

COOOACTACr (SEQ ZD MO:S99); 

6GAGTACTTC (SBQ ID MO: €01); 

AGTACTTCAA (SBQ ID NO: 603) ; 



SENT BY:Davies Collison Cave ; 6- 6-95 ; 18:40 ; 

M0M(««\iVMyiMflV1.US:^7 



516 742 4366;»86/j4 



OTAiCTTCAAO (SBQ ID NO 1 604) 

ACTTCMOIUI (SBQ ZD HOiC0«) 

TTCAAOAACT (SBQ ZD NOifiOt) 

CAAOAACTOC (SEQ ID 110:610) 

AGAACTOCTG (SEQ ZD 110s€l2) 



267- 

TACnCJkAGJt («BQ ZD HOtCOS) ; 
CTTCaUlfllUU : CSSQ id N0sC07); 

Tcaukoiukcni (mq id iiotcod) ; 

AXaiUICXOCT (MQ ID K>:C11) ; 

ajkAeiacran (mmq id iiO:«i3) . 



lot. A ma<hod aceoidteg to claim 94 wfacxfci mid dtbdaa la c p ny t MBt t least one 
of the followiog deiaauieleotide* from tfae i^ffwe of HnMm^: 





(SBO 


ZD 


MOsSOS] 


0CMK7IV0TT 


(BBQ 


ZD NO 


t804] 




[SBQ 


ID 


liOstOS] 






ID HO 


(806) 




(SEQ 


ID 


IIOs«07] 


CVWRACAC 


tmn 


ZD MO) 


i808] 


TTSTTACAXX! 


(SEQ 


ID 


NOtSOS] 


TOtTACMCCC 




ID MO: 


i810] 




fSBO 


ID 


NO t 811) 




(MBQ 


ZO HOi 


t812] 


TAGACCCTGT 


[SSQ 


ID 


NOt813] 






ID NO] 


t814] 


CACCCTOTOA 


tSBQ 


ID 


lfO:81S] 






IB NOs 


i816] 


CCCTCrTGAOC 


(SBQ 


ID 


NO: 017) 


ocrorcHiacc 


(M 


XD MOi 


i818] 


CTGTGAOCCT 


[SBQ 


ID 


M0:tl9] 


Tovnocczo 


(«BQ 


ZD HO) 


[820] 


(n?GlU3CCTQC 


CSEQ 


ID 


NO: 821] 


TMCCTOCA 


cm 


MOi 


,822) 


GMXXrrOCAT 


(SBQ 


ZD 


>IOi82)] 


MK9C1IOGAT8 


CMQ 


ID HOi 


824] 


acxrracAsroa 


(SBQ 


ID 


M0t82S] 






m MOi 


826] 


CTOCATaOAA 


(SBQ 


ZD 


MOtSa?) 




MQ 


t» MO: 


828] 


GKATOGAATG 


[SBQ 


ZD 


NOt829] 




COTQ 


ZO MOi 


830] 


ATOQAATGGA 


[SBQ 


ZD 


NO: 821) 




(MQ 


10 MO] 


832] 


QOAMOaKSG 


(SEQ 


ZD 


NO:833) 




(SBQ 


ZD MOi 


834] 


AATOOATOAC 


(SEQ 


ZD 


NO:83S) 


ATOOATOACC 


(SBQ 


XD NO] 


836] 


TGOATGACCC 


[SBQ 


ZD 


NOt837] 


OCH^IGACCCT 


(SBQ 


ID MO; 


.838] 


OATOUCCCTG 


[SBQ 


ZD 


NOt839] 


ATQACCCTOA 


(SBQ 


ID MO I 840] 


TGACCCTCAG 


[SEQ 


ZD 


NOt841] 











102. A mefiiod acccm^ to daim 94 
of the following decanideotides flom the 

OCTTTTTGCC (SBQ ID NOt652) 

TTTTTOCCTO (SBQ ID N0:S54) 

TTTOCCTGTA (SBQ ID N0:6S6) 

TQCCTQTACT (SEQ ID N0:fi58) 

CCTQTACTGG (SBQ ID NO: 660) 

TGTACTGGGT (SEQ ID NO t 662); 



m ttld deletlaa MMoopaMi 8t least one 
LTR ii^<Ni of IlIV*l|4i4|: 
CTTTTTQCCT (SBQ XD 110:653) ; 
TTTTGCCTGT (SBQ XD MO:655); 
TTaCCTGPTAC (SBQ ZD N0r6S7); 
OCCTOTACTO (SBQ ZD llO:6S9) i 
CT6TACTQG0 (SEQ ZD NO: 661) ; 
GTACTGGOTC ( SEQ ID NO : 663 ) ; 



SENT BY'-Davtes Col I Ison Cave ; 6- 6-95 ; 13:41 ; 





TACTOOOTCT 


(SBQ 


ID NO:€64) 


1 


ACTQOfrccrc 


(SBQ 


ZD ■0:885) ; 




CTGGGTCTCT 


(SBQ 


ZD VK>t€66) 


1 


TdoanccTCTC 


(«BQ 


ZD WO: 887) ; 




GGGTCTCTCT 


<SBQ 


ZD 1SK>t66ay 


t 




(BBQ 


ID NO:889) ; 




GTCTCTcraa 


(SBQ 


ZD B0:670) 


t 


TCXCTCXOOr 


(MQ 


ID llDt871) ; 




CTCTCTOOTT 


(SBQ 


ID Ili0s672) 


I 


TCTCGKITXll 


(«BQ 


ZD BOt673) ; 




CTCTOOTTAO 


(S8Q 


ID li0t€74) 


1 


•VCTCTiMiirUi 


<MQ 


ZD MO 1 875) ; 




CTOGrTTAOAC 


(SBQ 


ID NO: 676) 


1 


TGOrXMAOC 


(BBQ 


ZD MO: 877) ; 




GOTTAOACCA 


(SBQ 


ID V0i€7B) 


1 


GTCMBACOa 


(OTQ 


ZD HOt879) ; 




TTAOACCAOA 


(SBQ 


ID NO:C80) 


/ 


TJySACOUSAT 


(SBQ 


ID MO: 801) ; 




AGXCCJVa&TC 


(SBQ 


ID NOtSSa) 


/ 


GAOCliaATCT 


(BBQ 


ZD MO: 883) ; 




ACCAOATCra 


(SBQ 


ZD MOtSS4) 




OCSMtfASPCTCUl 


(SBQ 


ZD MO:88S) ; 




CAGATCTGAGi 


(SBQ 


ZD HK>x68€) 




Muactanac 


(BBQ 


ZD 110:887) ; 




GATCTGAOCC 


(SBQ 


ID HOrCSa) 




XTCVQACXXTT 


(BBQ 


ZD MO: 889) ; 




TCiaHQCCTG 


(SBQ 


ID llOt690) 




CTCHUBOCTOQI 


(MQ 


ZD 110:891) ; 




TOAaCCTQaG 


(SBQ 


ID lilO:«92) 




GMSCCVaoOA 


(SBQ 


ID MOt893) ; 




AOCCTOGGXa 


(SBQ 


ID liO:694) 


; 


Goevooaiuic 


(BBQ 


ZD MO:89S) ; 




CCTaQOAOCT 


(SBQ 


ID U0i69S) 


; 


craaaAocTC 


(BBQ 


n> M0t897) ; 




TOOcnacTCT 


(SBQ 


ID liO:€9t) 


; 


OKMMICrCTC 


(BBQ ZD NO: 899) ; 




aaAOCTCTCT 


(SBQ 


ZD NO: 700) 


1 


OMMnCTCIXI 


(m SO MO: 701} ; 




AaCTCTCTOa 


(SBQ 


ZD 110(702) 


; 


OCTCfCIOQC 


IMQ 


ZD MDiTOS) ; 




CTCTCTQQCTT 


(SBQ 


ID Il0t704) 


• 
$ 


TCV0VOOGXA. 


(M 


ID MO(705) ; 




CrCTGOCTAX 


(SBQ 


ZD MOtTOC) 


$ 




(■■Q 


ZD MOt707) ; 




CTOaCTAACT 


(SBQ 


ZD 110:700) 


; 


TdoetiiuunA 


(OTQ lO 110:709) ; 




OaCTAACTAO 


(SBQ 


ID N0:710) 


1 


acTMcmaa 


(BBQ 


ZD 110:711) ; 




CTAACTAOGG 


(SBQ 


ZD MOt712) 


1 




(•■Q 


m MO:713) ; 




AACTAGOGaA. 


(SBQ 


ID MO: 714) 


t 
1 




(SBQ 


ID 110:715) ; 






(SBQ 


ID NOt71€) 


• 

9 


TMKiaAACCC 


(SBQ 


ID MOi717) ; 




AGGGAACCCA 


(SEQ 


ID 110:718) 


1 


ooaiUkOCCAC 


(SBQ 


ZD MO 1 719) ; 




ggaaccx:act 


(SBQ 


ID MO t 720) 


1 


OAAGOCACXQ 


(SBQ 


ID NO: 721) ; 




AACCXIACTaC 


(SEQ ID iiOt722) 


t 


HJOCOhCtGCT 


(«BQ ZD N0i723); 




CCCACTOCTT 


(SEQ ID KG X 724] 


m 


GCACraCTTA 


(«BQ ID N0t725) ; 




CACTQCTTAA 


(SEQ 


ID NO:726l 


1 ; 


ACTSCTTAAG 


(SBQ ID N0:727) ; 




CTGCTTAAOC 


(SEQ 


ID NO :72a] 


F 9 


TOCITAMSCC 


(SBQ ID MO:729) i 




6CTTAAGCCT 


(SEQ ID K0:730' 


^ 9 




(SBQ ID NO: 731) i 




TTAAOCCTCA 


(SEQ 


ID NO: 732 


) i 


TJUIlSCCTCAA 


(SBQ ID NO:733) ; 




AAGCCTCAAT 


(SBQ ID NO: 734 


\ * 


AGCCrOATA 


(SEQ ID NO:735) ; 




GCCTCAATAA 


(SEQ 


ID NO: 736 




CCTCAATAAA 


(SEQ 


ID NO: 737) ; 




CTCAATAA2^ 


(SEQ 


ID NO: 738 


) 1 


TCAATAAAOC 


(SEQ ID N0:739) ; 



NELBOLRiNE- 516 742 4366;ff87.a4 



SENT BY:Davles Colllson Cave ; 6- 6-95 ; lc:41 ; MELBOLltNt^ 516 742 4366;»8e/34 
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CAXTAAAOCT 

AXMAOCTTG 

AAAGCTTGCC 

AOCTTaCCTT 

CTTOCCTTUA 

TaOCTTQAOT 

CCTTOAOTaC 

TTGA£3TaCTT 

GmOTGCTTCA 

OTGCTTCIUIO 

6CTTCSMOTA 

TTGAAOTAST 

GaUWOTAOTaT 

TAOZOTOTOC 

OTGTGXOCCC 

CSTGTCIGCCX3T 

CnDGCCCGTCT 

GCXXXSTCnOT 

COQTCTGTTO 

OTCZGTTGrrO 

CTGTTCnXSTO 

aTT9T9TGAC 

TOTOTaACTC 

TSTGrrOXCTC 

TOTOACTCTG 

TXaACTCTGGT 

ACTCTaOTAA 

TCTGGTAACr 

TOOTAACTAO 



(SBQ ID HO s 740) ; 
(SBQ ZD IIOt743) ; 
(SBQ ZD irO:744) ; 
(SBQ ZD llOt74C) ; 
(SBQ ZD 110:748) ; 
(SBQ ZD NO t 750) t 
(SBQ ZD 80:752) t 
(SBQ ZD ISOi7f4) ; 
(SBQ ZD N0;7S€) ; 
(SBQ ZD I10:7S8) / 
(SBQ ZD NO t 7^0) ; 
(SBQ ZD NOt7C2) ; 
(SBQ ZD NOs7€4} ; 
(SBQ ZD NO:7CC) / 
(SBQ ZD M0:7Ca); 
(SBQ ZD NOi770) | 
(SBQ ZD MO? 772); 
(SBQ ZD NOs774) i 
(SBQ ZD N0i77«)| 
(SBQ ID NOi77t); 
(SBQ ZD MO I 780) ; 
(SBQ ZD NO t 782) ; 
(SBQ ZD MOt784>f 
(SBQ ZD NO:78S) ; 
(SBQ ZD MO: 788) ; 
(SBQ ZD NO: 790) ; 
(SBQ ZD N0t7»2) ; 
(SBQ ZD NO: 794) ; 
(SBQ ZD NO: 796) ; 
(SBQ ZD N0{798) i 



AXTAMOCTT 

Tjuuwacnoc 



Qcmccma 



Aarararaoc 



vwttMxscarrc 

TaCOOOfC C tQ 



oarsamton 

TlVtHiiUJiCT 

cnwrajicrcT 

GSVQMACTCT 
GTOUCTCTOO 

CTCTOQTAAC 
CiaorTAACTA 
OOTAACTAGA 



(SBQ ID HO: 741) ; 
(SBQ TO NO: 743) ; 
(BBQ ZD HOt74S} ; 
(MQ ZD BO: 747} ; 
(BBQ ZD HDt749) ; 
(SBQ ZD HD<75a) ; 
(SBQ ZD MO: 753) ; 
(SBQ ZD HO:755) ; 
(MQ ZD MO:757) ; 
(SBQ ZD ■0x759) ; 
(SBQ ZD H0:7€l) ; 
(HBQ ZD NO: 763) ; 
(BBQ ZD WOtl€S) ; 
CSBQ ZD HO: 767} ; 
(OBQ ZD MO: 769) ; 
(SBQ ZD NO:771) ; 
(SBQ T» NOi773) / 
(SBQ ZD MO:77S) i 
(BBQ XD HDt777) / 
MQ Xb II0I779); 
(SBQ Bl MD>78X) / 
(SBQ ZD 110:783) ; 
(SBQ ZD MO: 785) ; 
(SBQ ZD HO 1 787) ; 
(SBQ ZD NO: 789) i 
(SBQ ZD M0s79l) ; 
(mQ ZD MOi793) } 
(SBQ ZD MO: 795) ; 
(SBQ ZD NO: 797} ; 
(SBQ ZD MO: 799) . 



103. A mediod for ddeimining the patiho«»iaby of m mv.i HIV-1 
tfnin infects cells of aa individual, stid mdliod eain|Muii^ ftrtrnnhAn die pKaetioe 
of a deletbn nnitotion in its gnome of at leMt 10 nucleoli 
nucleotide 8787 to nucleotide 9709 using tfae nucleotide auoberiiw «f HIV-l|^j 

wherein the presence of said deletion mntatictt is indicative of tte 
padiogcntc strain of HIV-1. 
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104, A mdbod •eooidliig to cWm 99 whawB trid 
ddelion in iti geoome of al leMl 10 imclootk^ 
list coiMiiting oft 

unOea&Sc 0) S830-ttti2; 

00 900»-9O35: 
(Ui) 9019'4029: and 

(iv) 9033-9049. 



105. A mediod aeoocdfaig to claim 99 m^Miin aaid nun |iaAngirnln FHV-l oames a 
ddedon in its gcaome of at leait 10 Bucleolidat ftom ndtUn « Mg^ 
list consisting oft 

audeoUde (v) 9211^71; 

(vi) 92tl^9362; 

(vii) 910S-9224; and 
(viiO 9271-9370. 



106. A flutfiod annnnBng to daim 99 ««iagain asid nun paiMtanir MV-1 canias a 
ddelion in its genome of «t kMt to anelaolidia ftans witUn a aagloa faUcasd 1^ 

list eondsting oft 

nucleotide (ix) 8m-492t; 



(X) t850-9006; 
(jd) 4792-9041 
(xii) 9112-9204. 



and 



107. A mcdiod acoocding to claim 99 wfacnia aaid tio n yUhncfiw HIV-l caffiei a 
ddedott in ill geiKMncoftt least 10 im e kfiHd M torn 
list consisd^g oft 

nicieotide (xiii) 910S-9224; 

(xiv) 9319-9395: and 

(XV) 92S1-9366. 
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108. A mediod acconfiag ID claim 99 wbei^ria Mtt^ 

of the foHowing decamadeolida fiom IIib nrft^m of fUV-luLO* 



ATGGGXX36CA (SBQ ZD NO: 2) ; 


TGMMnraocxx 


<«»Q 


ZD m 


:>:3) ; 




/ Tff% - ^ % . 


GKnpoQouuyr 




lO w 


>tS) 1 




(SBQ ID bKjzH} $ 




(W 


\ ZD SO:7); 




(SBQ XO mOs8) / 


GCaUMRaOTC 


(MO 


ZD »0:9); 


CAA0TGKITCA 


(SBQ ID wOiXO); 




(•■Q 


ZD 


MO: 11) ; 




(SBQ XD mOsU) ; 


cnoarcauAA 


(MQ 


n> 


■0:13) ; 




(SBQ ZD JHO:14) ; 


<KII€MUUUkO 


(MQ 


ZD 


NO: 15) / 




(SBQ ID NO: 16) ; 


TOUVAAAOTA 


(«BQ 


XD 


MO>17) ; 




(SBQ ID NOtlSli 


AJMAAOIiAaT 


(SBQ 


ID 


MO: 19) ; 




(SBQ ZD llOt20) ; 


AAAAnunOT 


(MQ 


ZD 


MO:21) ; 




I9BQ ZU JwOt22); 


AdnBAOVOTOA 


(SBQ 


ZD 


MO<23) ; 




(SBU ZD ilOS24/ 1 


TASnPHTQATT 


(SBQ 


ZD 


MO:2S) t 




l9lSQ XU jBIU33o|| 


CMRIATTOQ 


(«BQ 


XD 


Bflr% _ ^ •« V 

■0x27} ; 




(SISW ZD JNUI2V) / 




(BBQ 


XD 


MW • mW i f 




(SBQ XD WKftSQi $ 


OArraGATaa 


(BIQ 


ZD 


aO:31) 1 


A1*JN3CSM?0QC 


(SBQ ZD llOi32) 1 


TTCQMliaOCSC 


(BBQ 


ZD 


■0:33) / 


TOQATOQCSCT 


(SBQ ZD llOi34} ; 


(NHSXIOOCrai 




ZD 


■0:35) ; 


OATGaOCTOC 


(SBQ XD M0:)() ; 






ID 


■0:37); 


TaoccTOCTa 


(SBQ ZD VOsSB) t 


QHMK5WC1UT 


(MBQ 


ZD 


■0x39) ; 


OCCTOCTOTA 


(SBQ ZD M0t40); 


ocracraTAA 


(BBQ 


ZD 


■0:41) ; 


CTOCTOTAAa 


(SBQ ZD MO:42) ; 




(BBQ 


ZD 


■0:43) ; 


ocTGrrAAoao 


(SBQ ZD 110:44) ; 


CmAAOOOA 


(•■Q 


ZD 


■0:45) ; 


TOTAaOOGAA 


(SEO ZD N0i46) ; 


afnukaooAAA 


(SBQ 


ID 


■0:47) ; 




issy ID liOt4vi ; 




(BBQ 


ID 


■0:49) ; 


AaOOAAAOAA. 


(SBQ XD NO:50> ; 


CKMAAAOAAT 


(SBQ 


ID 


■0:51) ; 


OCUUUUSAATO 


(SBQ ZD liOxS2) ; 


OAAAOAATOA 


(SBQ 


ID 


■0:53) ; 




(SBQ ZD N0:54) ; 


AAOAATOAGA 


(«BQ 


ZD 


H0:S5) ; 


AOAATOAOAC 


(SBQ ID N0:5€); 


aAXTOAaAoa 


(CBQ 


ZD 


110:57); 


AATQAiQAOQA 


(SBQ ZD 110:54); 


ASKAOAOaAa 


(SBQ 


ID 


110:59); 


TGAGACXSAOC 


(SBQ ZD NO: 60); 


OMMSQAaCT 


(8BQ 


ZD 


NO: 61); 


AOAOOAOCTO 


(SBQ ID NOt^a) ; 




(SBQ 


ZD 


N0:63) ; 


AOGAfiCTGXG 


(SEQ ZD 110:64) ; 


OQMICTaAOC 


(SBQ 


ID 


MO: 65) ; 


OAOCTQAOCC 


(SEQ ID NO: 66) ; 


AOCTQAQCCA 


(SBQ 


ID 


NO: 67) ; 


GCTQAGCCAO 


(SEQ ID N0:68) ; 


CTOAOCCAGC 


(SEQ 


ID 


NO:69) ; 
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TGAOCCftGCA (SBQ ID MO:70) i 
A0CX:MKLMX: (SBQ id N0:72); 
COUaCAOCAa (SEQ id N0:74); 
AGCAiOCftaA.T (SBQ ID 110i7€); 
dkOCMhTOa (SBQ ID MO:76) ; 
GCMATOOOO (SEQ ID MOsSO) ; 
AOATOOGKITO (6SQ ID M0:S2); 

KToaacnaaa (sbq id mo:84); 
aaaaraaaAa (shq id no:8«); 
GOTaaaAacA. (sbq id no««8); 

TOOaAaCAOT (SBQ id NO: 90); 
aaHQCAaTAT (sbq id MOt92); 
AGCAOTATCT (SBQ ID liO:»4>; 
CAOTATCTGO (SBQ ID NO:»6); 
<3TATCTCX»0 (SBQ ID M0t»6) ; 
ATCTOGASAC (SBQ ID NO: 100); 
CTOGAGACCT (SBQ ID NOtl02) ; 
OaAOACCTAO (SBQ ID 110x104); 
AOACCTAGAA (SEQ ID NO:106) | 
AOCTAOAAAA (SBQ ID 110:100) f 
CIASAAAAAC (SBQ ID HO: 110) i 
AOAAAAACAT (SEQ ID HO: 112); 
AAAAACATOO (SBQ ID NO: 114); 
AAACAT60AO (SBQ ID NO: 116); 
AiCATOGAGGA (SEQ ID NO: 110); 
ATCKAOCAAT (SEQ ID NO: 120); 
OOAOCAATCA (SEQ ID NO: 122); 
AOCAATCACA (SBQ ID NO: 124); 
CAATCACAAO (SBQ ID NO: 126); 
ATCACAAOTA (SBQ ID NO: 128); 
CACJUiGTAOC (SEQ ID NO: 130) ; 
CAAGTAGCAA (SEQ ID NO: 132) j 
AOTAGCAATA (SSQ ID HO: 134); 
TAGCAATACA (SBQ ID NO: 136); 
OCAATACAGC (SEQ ID NO: 138) ; 
AATACAGCAG (SEQ ID NO: 140); 
TACAQCAQCT (SEQ ID NO: 142); 
CAOCAOCTAA (SBQ ID NO: 144); 





(M 


IQ ZD 


HO:71) ; 




(SB 


Q ID 


N0:73) ; 




im 


Q ZD 


MD:75) ; 


ocaflcamATO 


(mt 


Q ID 


MO»77) ; 


AocMamaa 


im 


Q ZD 


MOx79) ; 


GMHUrOQOQT 


(am 


Q ID 


HDt81) ; 


OATOWMKITCKy 


ma ID 


M0:03) ; 


laOOOTOGGA 


(m 


a XD 


NO: 85) ; 


(MDNKKHkOC 


(m 


3 XO 


MO:87) ; 




(«K 


} XD 


MO:89) ; 


aaaii0CitaTA 


(9m 


3 ZD 


HD:91); 




(am 


2 ZD 


N0:93) ; 


(ICiUirATlJTL! 


iaMQ ID 


N0t95) ; 


AflTCATCTOOA 


(SBQ ID 


N0:97) ; 


TMNnOQAOA 


(ABC 


) XD 


BD:99) ; 


TCrOGBkOACC 


(MO 


n> 1 


IDtlOl) ; 


TCGMOAOCTA 


<«BQ 


ZD MOtl03) / 


OAOAOCTAOA 




ZD BD: 


105) ; 


OAQCEAaAAA 


cm 


ZD BOi 


107) ; 


CXTCMAAAAA 


(MQ 


ZD BDi 


109) ; 


TAOAAAAACA 


(OQ 


ZD BDi 


111) ; 




(MQ 


ZD BO: 


113); 


AAAACAT9SA 


(no 


ID NO: 


115) i 


AACATGOAaC 


(ma 


ZD BO: 


117) ; 


GATOOAOCAA 


(SBQ 


ID HO: 


119) ; 


TOOAOCAATC 


(«IQ 


ZD MO) 


121); 


GUbOCAATCiAC 


(MBQ 


ID NO: 


123) ; 


OCAATCACIIA 


<s^ 


ZD NOi 


125) ; 




(SBQ 


ZD NOi 


.127) ; 






XD MOi 


;129); 




(MQ 


XO K>i 


1131); 


AAtfZAuCAAT 


(m 


XO 00; 


rl33) ; 


onyocAXTAC 


(HQ 


XD UO i 


(135) ; 


AOCaUVTACAia 


(8IQ 


XO HOi 


;137) ; 




(SBQ 


XD MO 


;139); 


ATACaUSCAQC 


(«BQ 


ID MO 


:141); 


ACAOCAGCTA 


(SBQ 


ZD NO 


:143); 


ACCAGCTAAC 


(8EQ 


ZD NO 


:145) ; 



itM tit-Uavies Loiiison Lave . o- c-do ■ io-4o 
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GCAfiCTAACH (SBQ ZD HOtl4S)} 

MCtMOAT (SBQ XD llO}148) ; 

CTAACakATOC (SBQ ZD NOtlSO) ; 

AACAATOCTO (SEQ ZD 110:152); 

CAATOCTOCT (SBQ ZD MO: 154) ; 

ATaCTOCTTQ (SEQ ZD liO:15£) ; 

aCTOCTTGtTO (SBQ ZD llOtl58) ; 

TOCTTOTOCC (SBQ ZD lfOtl«0); 

CTTGrrOCCTO (SBQ ZD 1K:>:1£2); 
TGTOCXrrOOC (SBQ ZO N0:164}; 
TOCCTGOCTA (SBQ ZD NO: ICC); 
CCXOGCTAGA (SBQ ZD MO: ICS) ; 
TG6CTAGAAG (SBQ ZD MO: 170) ; 
GCTAGAAGCA (SBQ ZD 170:173); 
TAOAAOCACA (SBQ ZD MO: 174); 
OAAOCACAAO (SBQ ZD MO:17C); 
AOGACAAOAa (SBQ ZD MOil7t) ; 
CACAAGAGOA (SEQ ID NO:1SO) ; 
CAAOAaOAlSO (SEQ ZD MO:182) ; 
AOAOGSAGGAA (SBQ ZD M0tl84) ; 
AGGAOGXAOA (SBQ 10 MOiltC) f 
GAGaAAOAaO (SBQ ZO H0:1M); 

oaAAaAOGrro (sbq zd moii^o); 

AAOAaaTOaO (SEQ zd M0:1S2) f 
GAOCrrOOam (SEQ zd N0:194); 
OaTOaGTTTT (SBQ ZO N0:19C) ; 
TGGGTTTTCC (SBQ ZD MO:198) ; 
GGTTTTCCAG (SBQ XO NO:200) ; 
TTTTCCAarC (SBQ ID NO: 202) ; 
TTCCAOTCAC (SEQ ID NO: 204) ; 
C<SmCKCkC (SEQ XD NO: 206) ; 
AGTCJlCACCTr (SEQ ID NO: 208); 
TCACACCTCA (SEQ ID N0:210) ; 
ACACCTCAOO (SEQ ZO N0:212) ; 
ACCTCAOGTA (SEQ ID N0:214) ; 
CTCAG6TACC (SEQ ID N0:21€) i 
CAGQTACCTT (SEQ ID N0:218) ; 
GCTACCTTTA (SEQ ID N0:220) ; 
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CAOCTAACAA (SSQ ZD 110:147) ; 

OCXMkCAATO (MQ ID »0:149) ; 

TAACMiXOCT (MQ ZO MO: 151) ; 

ACAA3OCT0C (BIQ ZD lK>:15a) ; 

AASOCTQCTT (M 10 110:155); 

T Q CSOC Vl X r V imO ZO M0:157); 

CTOCnOXOC (ma m M0:159); 

OCTXGTOOCT («BQ ZD I»:1C1) ; 
TTOnaOCTOa CJMQ zd IK):1C3) ; 
(VraOCraOCT (MQ ZO M0:1C5) ; 
(iaC9Eia(ICT3ia («Q zd MD:1C7}; 
CmCTAOAA (flBQ ZB MO:lC9) ; 
MCBkOAAOC (MQ ZD H0:171) ; 
CTKOAAOCAC (iSQ ZD M0tl73) ; 
AOMOCACAA (MQ lO N0:175) ; 
AAOC3LCAAQA (MQ ZD 110:177); 

aanoiMKaa (mq zd MO1179); 

ACAAOMKiAa (MQ ZO 110:181) ; 

AAOMoaAaaA (mq zo MOtiaa) ; 

MMMXSUUI (MQ ZO VOtlSS) ; 
MMOAAOM (MQ ZO n0tl87) ; 
AMMMMOOr (MQ I» K>tl89) ; 
OAAOMIOTOa (ma XO 1K>:191) ; 
AOMKITaaOT (MQ ZO MO: 193) ; 
AOaraaaTTT (MQ zo 1I0:195) ; 
OnOOOTTTTC (MQ ZO MOtl97) ; 
OOCnrTTTCCA (SBQ ZO 190:199); 
(filAlC CAOT (MQ ZD HO:a01) ; 
TTTOCA0TCA (MQ ID NO:203); 
teCMTCkCk (MQ 10 110:205) ; 
amCkOiCC (MQ id NO:207) ; 
OKCkCkCCIC (MQ ID NO:209); 
dbClkOCTCaUS (MQ ID 110:211); 
(aUSCTCAOOr (MQ IO M0:213) ; 
CXrXCAGGTAC (SBQ ID NO:215) ; 
TCMOTACCt (SEQ ID NO: 217) ; 
AOOrACCTTT (SSQ ID HO:219) ; 
OTACCTTTAA (SEQ ID NO:221) ; 
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TACCTTTAAa (SEQ ID NO: 222); 

CdnAAOAC (SEQ ID 80:224); 

TTTAAlOACCA (SEQ ID Ii0s236) ; 

TAAOACCaVAT (SEQ ID NO: 228) ; 

AOAOCAATOA (SEQ ID IK>:230) ; 

ACCAATOACT (SEQ ID NO s 232) ; 

CAATGSACTTA (9BQ ID NO: 334) ; 

ATOACTTACai (SEQ ZD MOsa34^) ; 

GACTTACAAa (SEQ ID N0t33S) ; 
CTTACAAGGC (SEQ ID N0:240) ; 
TACAAGGCAC (SEQ ID MO:242) i 
CAAOGSCAGCT (SEQ ID NOt244) ; 
AOOCA^acraT (SBQ id I10:24<); 
OGAOCTOTAa (SBQ ID NOt24t) ; 
A6CTGTAGAT (SBQ ID NO: 250) ; 
CTSTAGATCT (SBQ ID NO: 252) ; 
OTAOATCTTA (SBQ ID NO: 254) ; 
AOATCTTAOC (SBQ ID NOt25C) | 
ATCTTAGCTA (SBQ ID NO: 258) ; 
CTTAOOCACT (SBQ ID N0i3«0) ; 
TA£!GCACTTT (SBQ XD I90t3C2) ; 
OCCACTTTTT (SEQ ZD 110:284) ; 
GACTTTTTAA (SBQ ZD NO: 288) ; 
CTTTTTAAAA (SEQ ZD HO: 288) ; 
TTTTAAAAOA (SEQ XD N0:270) ; 
TTAAAAOAAA (SEQ ZD NO: 272) ; 
AAAAOAAAAa (SBQ ZD N0:274) ; 
AAOAAAAOaO (SEQ XD NO: 278) ; 
OAAAAOGGgO (SBQ XD NO: 278) ; 
AAAGGGGGGH (SEQ ID NO:2S0) ; 

AaaaaaoACT (seq xo in>:283); 

(SGGCKSACTOa (SEQ ID NO: 284); 

CKKACTGGAA (SBQ ID NO: 288) ; 

OACTGOAAOa (SEQ ZD NO: 288) ; 

CTaOAAGQCiC (SEQ ID NO:290) ; 

a(2AAaaocTA (seq id no : 2 92); 

AAGGGCTAAT (SEQ ID N0:294) / 
G6GCTAATTC (SEQ ID NO: 296) ; 
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AOCTTTAAOA (SBQ ID HOt223) ; 

CmAAOACC (SBQ XD IK>:22S) ; 

Tt M OUCChK (MQ XO HO:227) ; 

AAOACCAAXO (MQ XO MO: 229) ; 

<lAOCaiATaAC (Wm XD M0t331) ; 

OOAKraACTT (ODQ ZD ND:233) ; 
AJUOACTIiAC (MQ n> 110:235) ; 
VOACTCACSKX (8nQ ZD MD:237) ; 
ACmCAAOO (ffBQ XD 110:239); 
TTACAA06CA (MQ XD NO: 241) ; 
ACM0CK»aC («BQ XO MO:243} ; 
AMOCASCXO (MQ » 101245) ; 

o acAflCTOiii twmo » 110:347) ; 

CaMGVOXMIA (mo XO 00:349) ; 
CMTOXauaAXC («BQ SB M0:2S1) 7 
xmsnOATCTT (MQ XO M0:253) ; 

moAxcTTAa (sao » no : 25s); 

OMeTTAASC <MQ » MO: 357) ; 
TCVTAOGCAC (OBQ XO K>i359) ; 

rtmooicm (mo xo aQt38i) ; 
MMKSAcmr iflOQ xo NOcssa) > 

flOlCTTTTTA (SBQ XO 00:385) ; 
ACTTTTTAAA (OBQ XO 110:387) ; 
TTTTTAAAAa (SBQ XO 110:369) ; 
TTTAAAAGAA (SBQ XO 110:371) ; 
TAAAAGAAAA (SBQ XO NO:373) ; 
AAAOAAAAOO (SBQ ID NO: 375) ; 
AflAAAAGWGO (SBQ ID MO:377) ; 
AAMOGUOOa (SBQ XO MO:379) ; 
AMOCXKXttC («BQ ID 1K>:281) ; 
OOfiOOGMCTa <ilQ ID !K>:283); 
OQQQACTGKSH (IIQ 10 110:285) ; 
OKIKCTKKSAAG (8BQ XD N0:287) ; 
ACTOGAAOOa (SBQ ID iaO:289) ; 
raaAAOOOC:T (SBQ id N0t291); 
OAAOGaCTAA (9BQ ID MO: 293) ; 
AOOGCTAATT (SBQ ID NO: 295) ; 
OGCTAATTCA (SEQ ID NO:297) ; 
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OCTAATTGAC {8BQ ID liOi298) i 
TAATTCHCTC (SEQ ZD 110:300) / 
ATTCACTCCC (SEQ ZD HOt302) / 
TCACTCCC3UI (SBQ ZD N0:304) ; 
ACTCOCAAA0 (SBQ ZD MOxSOS) | 
TOCCMUUSim (SBQ ZD M0;300) ; 
CCAAAGAiUSli (SBQ ZD HO:310) ; 
AXAGAAOACA (SBQ ZD 110:319) f 

AOMUSACIUUS (SBQ ZD 110:314); 

AAOJLCJUiQAT (SBQ ZD IID:31C) i 
(SBQ ZD Ii0t31t) j 

CAAOATATCC (SBQ ID HO i 330) ; 

AOATATCCTT (SEQ ID MO: 322) ; 

ATATCCTTOA (SBQ ZD HO: 334) ; 

ATCCTTQAirC (SBQ ZD 190;32<) ; 

CSCTTOATCrO (SEQ ZD BO:32i) r 

TTOATCTOTa (SBQ ZD NO«330) r 

GAFCTGTOGA (SBQ ZD 110:332) » 

TCTQT9GUITC (SEQ ZD 110:334) i 

TOTCnATCIA (SBQ ZD 110:330 t 

TOGATCTACC (SBQ ZD llO:3at) $ 

GATCTACCM! (SBQ ZD ilO»340) # 

TCTACCXCAC (SBQ ZD K0:342) ; 

TACCACACM: (SBQ id tlO:344)r 

(XACACAC91A (SBQ ZD NO:34<) ; 

ACACACAAOO (SBQ ZD NO:34«) ; 

ACACAAOOCT (SEQ ID 1]0:3S0) ; 

ACAAOaCTAC (SBQ ID 110:352); 

AASGCTACTT (SBQ ID 110:354); 

G6CTACTTCC (SEQ ID N0:3SC); 

CTACTTCCCT (SBQ ID 130:350) ; 

ACTTCCCTGA (SEQ ID N0:3€0); 

TTCCCTQATT (SEQ ID NOt3€2} ; 

CCCTGATTG3 (SBQ ID MO: 364) ; 

CTGATTGGCA (SBQ ID NO!366) ; 

GATTOaCAGA (SEQ ID NO:368) t 

TTGOCAGAAC (SBQ ID NO: 370) ; 

GOCAGAACTA (SEQ ID NO: 372) ; 



275- 

CTMkTlCACT (MQ ZD MO:2»9) # 

AAnCACTCC (SBQ ZD ■0:301) ; 

TTQUCTOCXak (SBQ ZD 110:303); 

GACfOOCMUl (OTQ ID 110:305} ; 

CTOOCiUUiaA («BQ ZD BO: 307) ; 

OOOfcKAWHa (MQ XO 1IQ:309) ; 

CAMMAAOAC (SBQ ZD 110:311) ; 

AHOMIflAaUk (BBQ ZD 110:313) ; 

OMOIIGMaA (BBQ ZD 110:315) / 
AaAGAAOATA (BBQ ZD HO: 317) ; 
ACMtOATATC (SBQ ZD BO 1 319) ; 
AMMTATOCT (BBQ ZD MO: 321) ; 
OATMNXnTTO (BBQ ZD 110:323); 
TMKSCTTGAT (BBQ ZD MO: 325) ; 
TOCTTGATCT (BBQ ZD MO:337); 
CnOATCIOr (MQ ZB 1I0:329) i 
lOATCTOIOa (fBQ ZD MOt331) ; 
AferaraCMT (BBQ ZD M0(333) ; 
CmOOATCT (BBQ lO ■0:335) ; 
tinOBliarCIAC <BBQ XB «0t337) ; 
OOAVCTACCA (BBQ ZB «0:339) ; 
AarSfBCCIWCA (BBQ a» MOi341) ; 
CTA0CACAGA (BBQ IB M0:343) ; 
AOGAGACACA (BBQ ZD MO: 345) ; 
CaiCACACAAO (BBQ ZD 11D:347) ; 
CACACAA8GC (BBQ ZD NO:349) ; 
OkCaUUKSCTA (SBQ ZD HO: 351) ; 
CAACnCTACT (BBQ ZD NO:353) ; 
AQMCTACTTC (BBQ ZD N0:355) ; 
OCmmCCC (MQ id II0:357) ; 

mcnrnzra (bbq id ciO:359)i 

CnOOCTGAT (HQ ID NO: 361); 
TCXXrraATTQ (HQ ID W0t3€3); 
OCiaATTGKK: (BBQ ID MO:365) ; 
TGtATTOOCAO (RQ ID 110:3(7) ; 
ATTOQCAGAA (SBQ ID HO:3S9) ; 
TOaCAOAACT (SBQ ID N0t371) / 
(SCAOAACTAC (SBQ ID NO:373) ; 
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G3UUUICTXCA (SEQ ID NO 1 374) ; 
GAACTACACA (SBQ ID lIO:37C); 
ACTAGACAOC (SBQ ZD 210:370)/ 
TACACUCCM (SBQ ZD HOtBSO) ; 
CACA0CAG06 (SBO ZD NO: 382) ; 
CACCAOOOCC (SBQ ZD lK>t384) ; 
CCAOaacCM (SBQ id N0:3««)/ 

AOOGCCAOOO (SBQ ID B0:3af ) ; 

QOCCMOCXfT (SBQ ZD N0:390) ; 

CCAOaaOTCA (SBQ ZD NOtSSa) ; 

AOaOGrrCAGA (SBQ ZD II0:394}; 

GGGTCAGATlk (SBQ ZD MOi396) ; 

<3TGM3A1ATC (SBQ ZD MO: 398) ; 

CAOATATCCA. (SBQ ZD NO: 400) ; 

GATATCCACT (SBQ ZD 110:402) ; 

TATCCACTOA (SSQ ZD 110:404) ; 

TCCACTOACC (SBQ ZD irO:40C) / 

CACTQACCTT (SBQ ZD NO: 408) t 

CTOACCTTTQ (SBQ ZD NO:410) i 

GACCrrraOA (SBQ id N0:412) I 
CCTTTOOAIO (SBQ ZD M0:414) | 
TTTOGATOOT (SBQ ID NOt4lC) / 
TOOATOGnaC (SEQ ZD MO: 418) ; 
GATQOTQCTA (SBQ ZD MO: 420) ; 
TOOTOCTACA (SBQ ZD MO: 422) ; 
OTQCTACAAG (SBQ ZD HO: 424) ; 
OCTACAAGCT (SBQ ZD NO:424E) ; 
TACAAOCTAG (SEQ ZD NO: 428) ; 
CAAGCTAOTA (SBQ ZD NO:430) | 
AOCTAOTACC (SEQ XD KO:432) ; 
CTAOTACCAO (SBQ 10 NO: 434) ; 
AOTACCAOTT (SEQ ID NO:436) ; 
TACCAOTTOA (SBQ ZD MOt438) i 
CCAlQTTGAaC (SBQ ID MO: 440) ; 
AOnx3AGCCA (SEQ ID NO: 442) ; 
TT(3AaCCA(3A (SBQ ID NO: 444); 
GSAGCCAGATA (SEQ ID NO: 446); 
GCCAGATAAG (SEQ ID NO: 448); 
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AOAACTACAC (SBQ ZD MO:375) } 
AACTACACAC (SBQ ZD M0:377) ; 
CnCACAOCA (MQ ZD H0:379) ; 
AGACAOCAOO (MQ ID 110:381) ; 
ACMOC^OOQC (MO ZD nO:383) ; 
AOCAOGKICCA (MQ tD »0t385) , 
CMMaOCAOa (MQ ZO B0:387) ; 
aaaOauXXXi («BQ ZB 1I0:389) ; 
OOCAOGOGnrC CMQ ZD M0:391) ; 
GAM9OTCA0 (MQ ZD MO:393) ; 
GMOmCAOAT (MQ ZD nOx395) ; 
mCAOATAT (MQ ZD HO: 397) ; 
tCMKATATOC (MQ Dl MO 1 399) ; 
AGASATOCAC (MQ ZD M0s401) ; 
ATftfOCACXO <MQ ID H0s403) ; 
AVaaACTOAC (MQ IB M0«40S) ; 
<3CAGiaACCT (MQ ZO lK>t407) ; 
ACMAOCTTT (MQ ZD 110:409) ; 

raMoanroa (mq zd K>t4ii) ; 

AQeVTVaOAT (MQ ZB ■Ot413) ; 

cnvBOAsao (mq zo mo 1 415) ; 
TVQttKtaarra (mq xd aoi4i7) ; 

<MMaaaTOCT (MO Z& »0:419); 
AIMTGCTAC (MQ ZD MO: 421) ; 
GOVOCTACAA (MQ ZD MOi423) ; 
TOeXlCAAQC (SBQ ZD MO:42S) ; 
CXACAAOCTA (SBQ ZD MO:427) ; 
ACAAOCTAOT (SBQ ZD MOt429) ; 
AAOCCAGTAC (MQ ZD MOt431} ; 
OCnMITACCA (MQ ID MO:433) ; 
rUmiOCMT (MQ ZO H0:435) ; 
GTAfiCKGTTa (MQ IB MD:437) ; 
AOCROTTGAa (MQ ID VO:439) ; 
GMrTOAaCC (SBQ ZD irOt44I) ; 
(mOMCCkS (tlQ ID HO: 443) ; 
TQMCChakT (SBQ ID MO:445) ; 
AGOCAQATAA (SBQ ID MO: 447) ; 
CCASATAASQ (SBQ ID NO:449) ; 



SENT BY:Davles Colllson Cave : 6- 6-95 ; 1^:45 ; NdELBOURNE- 516 742 4366:»96/34 



CAOATAAOGT (SBQ ID MO:4S0) ; 
QHTMaOTMG (SBQ ID 110:452) ; 
TAAOOTAGAA (SBQ ZD MO: 454) ; 
AOOTAGMOIk (SEQ ID 1I02 4SC) ; 
OT^gAAflAO O (SBQ ID NO:45a)j 
AOAAOAaaOC (SEQ ID Ii0t4€0) ; 
^^OMaCCAK (SBQ ID 1ID:4C2) ; 
CUIOGCCAATA (SBQ ID 1I0;4«4) / 
aaCCAATMX (SSQ ZD NO:4€«) ; 
CGMTAAAOa (SBQ ZD ND:4(S) ; 
AATAAAOCna (SBQ ID Ii0t470} / 
TAAAGOAQAa (SBQ ZD NOt47a) ; 
AAOaAOMAA (SBQ ID HQ 1 474) i 
GQAOAOAACX (SBQ ID M03 47C); 
AOAOAACACC (SBQ ZD M0x47t) ; 
AOAACACCMI (SBQ ZD IIOt480) t 
AACACCAOCr (SBQ ZD N0t4a2) f 
CACCAaCTTG (SBQ ID 110:484) f 
CCAOCrraTT (SBQ ZD NOt40C) ; 
MCTTQTShC C8BQ ZO B0t4M) ; 
CTTCTTAClkC (SBQ ZD HOt4fO) ; 
TOTTACACXX: (SBQ ID IIOi49a) ; 
TTACAOCCTO (SBQ ID MO«4»4) ; 
ACACCCXGTQ (SBQ ID KOt4»4); 
ACCCTOTGAO (SBQ ID llOt498) ; 
CCTGTOAaOC (SBQ ZO MO; 500) / 
TGTGAGCCTa (SBQ ID M0i502); 
^^G^OCCTOGA. (SBQ ID MOtS04)i 
AGCCTOCATQ (SSQ ID N0:50€) | 
CCTGCATGOA (SBQ ID NO: 508); 
TGCATOGftAT (SEQ ID 110:510) ; 
C:^TOQAA'raa (SBQ id 1I0:SX2) ; 
TOOAATGOKr (SEQ ID NO 1 514) ; 
QAATQQMm (SBQ ID NO:516) ; 
ATOGATGACX: (SEQ ID N0:5lS); 
aGATOACCCT (SEQ ID NO: 520); 
ATQACOTGA (SEQ ID NO: 522); 
OACCCTGAOA (SEQ ID N0:524) ; 
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hOMTAMQTX (SBQ ID U0:4S1) ; 
AXMGGrAGA (BBQ ID 110:453} ; 
AMOXAOMO (BBQ ZD MO:455) ; 
«»»<»*fl»0 («BQ ZD MOt457) ; 

rmMMoac (ma zo MO1459) ; 

q^MKO O C C A (MQ ID HO«4€l) ; 

AOAOOCCAAT (MQ ID M0:463} ; 

AOWXMXftA (MQ ID 110:465); 

OOCA^TMAO (MQ ID BO:4«7) ; 

GAMSIUMOQA (MQ ZO MO>4C9) ; 
AfAMOGAOA (JBQ ID ■Os471) ; 
AAAOaMaaa^ (BBQ zo MDt473) ; 
MKnOASAAC (MQ ZO MO: 475) ; 
mAOAAGM* (MQ ZD H0:477) ; 
OMMCAGCA CMQ ZO M0:479) ; 
O^^eMCMC IMQ ZO MOt401) ; 
ACMGMICTT <MQ lO 110:483) / 
AOC5»OCTXOr (MQ ZD 110:485) ; 
OMCTTGnai CMQ ZO 110:487) , 

oenoTtkCk Mq » M0t409} ; 
nomcMse imq zo MDt4»x)i 
onMAooer (mq a> moi4»3}; 

mCMOOCTOr (mq IO M0:4»5); 
CKOOCTOTQA (MQ ZO 110:497) ; 
OOCTdTOAOC (MQ ZO NO:499) ; 

oraamiocT (sio zo noisodj 

flWWOeCXaC (SOQ lO HO:S03); 
OMCCTSCAT (SBQ ID MO:S0S) ; 

aoctGCAToa (osq id uotsoVf 

CVOCAtOaAA (MQ ZO 110:509) ; 
OCAiaOAATO (tBQ ID 110:511) ; 
ATQOAATOQA (MQ ZD M0:S13) ; 
OOAATOOATO (MQ ZD MO: 515) ; 
AKMOkTOkC (MQ ZD NO: 517) ; 
TOGATOACCX: (MQ ID N0:S19); 
aAT<»CCX!ro (81Q ID ND:521); 
TOACCCTGIAG (SBQ ID 110:523) ; 
ACCCTGASAO (SBQ ID N0:S25) ; 



SENT BY:Davtes Colllson Cave ; 6- 6-95 ; 18:46 



MELBOURNE- 



516 742 4366;»37/34 



CCCTOAOAOA (SBQ ID MO: 526) ; 

CXaMSAGAlOA (SBQ ID HOtSaS) ; 

aAOAOkOAAG (SBQ ZD HO: 530) ; 

aAOAaAAOTG (SBQ ZD 110:532); 

aAOAAaiGTT (SBQ ZD NO:534); 

OIUUrtOTTAa (SBQ ZD llOtSSS) ; 

AOTOTTAOMI (SBQ ZD MD:53S); 

TOTTAOAGTO (SBQ ID MO: 540) ; 

TTAGAOTOGA (SBQ ID llO;542) | 
AOAOTaOAOO (SBQ ZD 110:544) f 
AOTOGAaiTT (SBQ ZD N0:S4<); 
TOOMaOTTTO (SBQ ZD liO:S46); 
GAiOaTTTaJlC (SBQ ID MOtSSO) ; 
OGTTTOACAa (SBQ ZD llO:5S2) ; 
TTTQACMCC (SBQ ID IIO:SS4) ; 
TGACAOCOOC (SBQ ID MOiSSff) « 
ACAOCOGCXrr (SBQ id MO:55e); 
AQGOaCCTAO (SBQ ZD 1I0:S<0); 
CXXaoCTAGCA (SBQ ZD Il0:5e2)/ 

aocokOcaaT (sbq zd M0ts<4) i 

CTAOCXTTTC (SBQ ZD MOtS«C) t 

jjaciaTTCxr (sbq zd notsct); 

CATZTCATCK (SBQ ZD 110:570); 

TTTCATCACXi (SEQ ZD 190:572); 

TCATCACOTO (SEQ ZD NO:574) ; 

ATGACGTOOC (SEQ ZD N0:S7«) $ 

CACOTQOCCC (SBQ ID MO:S7t) ; 

COTOaCCCaA. (SBQ id HO:5aO); 

TOOOOOGAOA (SBQ ZD N0t582) ; 

acaxaaksc (sbq id N0tst4) ; 

CKXAGMSCTG (SBQ XD N0:5a<) ; 

(SAGAGCTOCA (SEQ ID 190:5S8); 

CAOCTOCATC (SEQ ID KO:590) i 

GCTOCATCXX} (SEQ ID MOtS92) ; 

TOCATCCiGGA (SEQ ID N0:594) ; 

CATCCQCSXQT (SEQ ID N0:596) ; 

TCCGGA6TAC (SEQ ID NO:B98) ; 

CGGAOTACTT (SEQ ID KO:S00); 



-27t- 

ccxoiu»aAa (sbq id 110:527} ; 

TQAanaAOAA (OQ id 110:529); 
AOMOkOJUIlSr (ma zd H0:531} ; 
JUmOIUiOTST (MBQ ID H0:S33) ; 
MBUUnCTTA (tmO ZD liOiS35); 
AIIOromUIA CMQ ZD M0i537) ; 
OTOTXAGMIT (MQ ZD HO: 539) ; 

ameaufftaa (ma zo ho:54i); 

TIflMOTGGAO (MQ ID Il0t543} ; 
OMnOaAOOr (MQ id 1I0:54S} ; 

ommacem tmo » ■o:S47)/ 

CMMOOTrraA (MQ ZD MO: 549} ; 
JlMmOACA CMQ m MOtSBl); 

amahcuac (mq zo iiO:S53); 

TVMCMiCOa (MQ ZO 110:555); 

q W O O OO OC (MQ ZO liOtS57)/ 

CMOOOkXTA (MQ ZO NO: 559) ; 

aOOaOCTAQC (MQ zo NOiSCD; 

OBGeiAGCAT (MQ ID OOtSCa); 

OempCATTT (MQ ZO MOsSCS) ; 

fMKATITCA (MQ lO M0tS<7) ; 

OBKmCkTC (MQ » llOiS«9) ; 

ARTCKTCAC (MQ » liOt571) ; 

TTCATCACOT (MQ ZD MO: 573) ; 

CKTCAOdTaO (MQ ZD HOs575) ; 

TdkOSTOOOC (OOQ ZO HO: 577) ; 

AOSVaGCCCO (MQ ZD NO: 579) ; 

OraOOOCSQAO (SOQ ZO ifO:5«l) ; 

aaCXXSOAOAa (mq zo M0:S«3); 

CSOCHAOAaCT (MQ » MOtSSS); 

(XMAOCTQC (MQ TO 1I0:$87) ; 

ABAOCrrOCAT (MQ ID K0:589); 

AOGICCATCX! (MQ XO 110:591) ; 

CniCATCXXXS (OQ IB MO: 593) ; 

GCKtCaaOAa (MQ id 1I0:595); 

ATCm3AQTA (MQ ID 110:597); 

CCOOAQTACT (SBQ ID 110:599); 

OOAOTACTTC (SBQ ID NO:fiOl) ; 



iLNT BY :0a vies Coin son Cave . 6- 0-35 ; ia:46 



MELBOLRNE- 



5ib 742 4366;?9a.-o4 



GAGTACTTOl (SBQ ID UO'.602) f 
OTACTTCAAG (SBQ ZD MO:C04} ; 
ACTTCAAOAA (SRQ ID 1IO:€0€) ; 
TTCAAOAACT (SEQ ZD KO:C0e>f 
CAAGAACTQC (SBQ 10 NO: CIO) / 
AGAACTQCra (SBQ ZD NO:C12); 



-279- 

AGnCACTTCAA (flBQ ZD KOxC03) ; 
TMCTTCAAOA (MQ ZD II0:C05) ; 
CnCJIAOMC IMQ ZD ii0s<07) ; 
TCMOMCTO C«RQ ZD H0><09) ; 
AMmACTOCr <MQ ZO MOtCll) ; 

aMCxacTOA (mq ^ notsia) . 



of die fbliowing decwueleotidec from me/ffmm of aiV-l>«^: 



ACCAOCTTOT 


[SBQ 


ZD MO 1 803] 




MQ 


ID NO: 804] 


CAOCTTGrrTA 


CSBQ ZD MO 1805] 




MQ 


ID 1I0:80C] 


oerimTACA 


CSBQ 


ZO MO; 807] 




[MQ ID NO: 808] 


TTCrTTAiPAfXT 


CSBQ 


ID HO t 808] 




CSBQ 


ZO N0:810] 


OTTACACCCT 


(SBQ 


ZD MOtOll] 


TTIUSAOCCra 


CBBQ 


ID NO: 812] 


TACACCTTOT 


CSBQ ZD liOtS13] 


ACSMXXnVJXI 


fMQ 


10 B0:814] 


CACCCICTGA 


CSBQ 


ZD llO:81S] 


AOOCToniAa 


C«BQ XD NO:Slfi] 


CXXTOTGACX; 


CSBQ 


ZD NO t 817] 


OCMnLXIAOGC 




ZO NO: 818] 


CTQTQAOCCT 


CSBQ 


ID NO t 819] 


TsnraAoocTo 




ZO HO: 820] 


OTdAaccTac 


CSBQ 


ZD NO t 821] 


VOSMKSCIXSCA 




ZD NOt822] 


GAOOCTOCAT 


CSBQ 


ID NO:828] 


MMucracAiu 


Mq 


m NDt824] 


GCCTOCATOa 


CSBQ 


ZD NO:82S] 


aeMCATOOA 


(ma 


» HO*828] 


CTGCATOOAA 


CSBQ 


ID NO: 827] 




tmo m NO 1828] 


OCATOaAATO 


CSBQ 


ZO NO t 829] 


omoAAToa 


Mq 


ZB N0:830] 


ATaOAATaOA 


CSBQ 


ZD NO t 831] 


TdKUUtTOOAT 


CBBQ 


ZD NO: 832] 


OaAATOOATa 


CSBQ 


ZO N0t833] 


OautVOGATOA 


CBBQ 


ID NO: 834] 


AATGOATGAC 


CSBQ 


lO NO. '835] 


AVOOATGACC 


(SIQ 


ID HO: 836] 


TGGATGACCC 


CSBQ 


ZO NO: 837] 


ooHaaACGXTr 


CttQ 


ID NO: 838] 


OATGACCCTG 


CSBQ 


ID NO: 839] 


AiaACOCTOA 


CMQ 


ZO NO: 840] 


TOACCCTGAG 


[SBQ ID !K)tB41] 









110. A method according to claim 103 ixditfelaaaidddet^ 

of tibe following decanadmrtidef finm flic LR. aegkm of IIIV-1|,q^: 

(SEQ ID NO:852)2 CXTTTTGCCT (SBQ XD NO;«53) } 



GCTTTTTOCC 
TTTTTGCCTO 
TTTGCCTGTA. 
TQCCTGTACT 
CCTQTACTGO 



(SBQ ID NO: €54) ; 

(SEQ ID NO: 656) 

(SEQ ID NO: 658) 

(SEQ ID NO: 660) 



TITTQCCTQT 
TTaCCTQTAC 
GCCTQTACTO 
CTGTACTGaO 



(SBQ IB NO: 655) 
(3EQ ID NO: 657) 
(SEQ ID NO: 659) 
(SEQ ID NO: 661) 



SENT BY:Davtes Colilson Cave i c- o-y5 . Ib-il 



TOTACTGCKaT 


(SEQ 


ID 


ilOtSC2) 1 


TACraOSTCT 


(SBQ 


ID 


110:664) 1 




(SBQ 


ID 


no* 666) ; 


SOOTCTCTCT 


(SBQ 


ID 


H0t668) 1 


(STCTCTCTQO 


(SBQ 


ID 


110:670); 


CTCTCTOOTT 


(SBQ 


ID 


BO: 672) ; 


CTCrOOTTAa 


(SBQ 


ID 


N0:674) ; 


CTaOTTAQAC 


(SBQ 


ID 


M0:676)i 


GGTTAOACCIl 


(SBQ 


ID 


110:678) ; 


TTAOACCAOA 


(SBQ 


ID 


NO:6aO) 1 


AOAiCCIUSATC 


(SBQ 


ID 


ilOt6«2) ; 


ACX»aATCTG 


(SBQ 


ID 


NOt684) / 


CAaATCTGMG 


(SBQ 


ID 


UOtSBO t 


GATCTGJU3CC 


(SBQ 


ID 


N0:6Ba) ; 


TCTGAGCCTQ 


(SBQ 


ID 


MO:6»0) ; 


TGJUSCCTOGO 


(SBQ 


ID 


110:6f2) t 


juaccTOOOAa 


(SBQ 


ID 


llOt6»4) ; 


cxnaooAser 


(SBQ 


ID 


llOt6»6) 1 


TOaaAOCTCT 


(SBQ 


ID 


lR>t6»8) t 


GOAaCTCTCT 


(SBQ 


ID 


IIO<700) ; 


JWQCTCTGTOa 


(SBQ 


ID 


110:702) ; 


CTCTCTOaCT 


(SBQ 


ID 


TO: 704) 1 


CTCTQOCTAA 


(SBQ 


ID 


110:706) ; 


CTOOCTAACT 


(SBQ 


ID 


110:708) ; 


QGCTAACTam 


(SBQ 


ID 


MO S # xU| 1 






TD 






(SBQ 


ID 


MO: 714) t 


CTAGCK3AACC 


(SBQ 


ID 


MO: 716) ; 


ACK36AXCCCA 


(SBQ 


ID 


110:718) 1 


OOAACCCACT 


(SEQ 


ID 


MO:720)} 


AACCCACTGC 


(SBQ 


ZD 


NOt722); 


CCCACTGCTT 


(SEQ 


ID 


110:724) I 


CACTOCTTAA 


(SBQ 


ID 


NO: 726) ; 


CTGCTTAAOC 


(SBQ 


ID 


NO 1728) ; 


GCTTAAOCCT 


(SEQ 


ID 


NO: 730) ; 


TTAAQCCTCA. 


(SEQ 


ID 


NO: 732) ; 


AAQCCTCAAT 


(SEQ 


ID 


NO: 734) ; 


QCCTCAATAA 


(SEQ 


ID 


NO: 736) ; 



OTHCNSOGTC 


(SBQ 


ID 


1K>:663) ; 


ACKMRrrCTC 


CBBQ 


ID 


liO:66S) ; 


naooRFCicTC 


(BBQ 


ID 


MOt667) ; 


ooicvcTcxa 


(m 


ZO 


BOt669) ; 


ICICTCTGOT 




ID 


M0:671} ; 




(SBQ 


ZO 


110:673} ; 




(SBQ 


ZD 


110:675) ; 


'iHAITTAOACSC 


(«BQ 


ID 


BO: 677) ; 




(SBQ 


XO 


110:679) ; 


iMAaawoAT 


(SBQ 


ZO 


110:681) ; 


(3AOCMATCT 


(SBQ 


ZD 


llOi683) ; 


OGaaArcvaii 


(SBQ 


ID 


llOiC85) ; 


jusmtcigmk; 


(SBQ 


ID 


110:687) ; 


AtCIOMCCT 


(SBQ 


B> 


110:689) / 


craROCCToa 


(SBQ 


J» 


MO:691) ; 


CMOCCraGOA 


(SBQ 


ZD 


M0:693) t 


aoBftooshac 


(MQ 


ZO 


110:695) ; 


CtOOGMGCTC 


(SBQ 


ID 


110:697) ; 


(MOAlOGTCTC 


(SBQ 


Tb 


110:699) t 


OMCTCTCTO 


(SBQ 


m 


110:701) ; 


QfSKXCSQQC 


(SBQ 


ZD 


M0:703) 1 


VCVCTOQCTA 


(SBQ 


n> 


110:705) ; 


ICIOOCTAAC 


(SBQ 


ZD 


110:707) ; 


TQdCTAACTA 




ZD 


Bam . *VACk \ • 

WO: Yu9| ; 










TAACTAQaOA 


(MBQ 


ID 


N0:713) ; 


ACTA/aaOAAC 


(SBQ 


ZD 


NO:71S) t 


XIAaOOAACOC 


(SEQ 


ZD 


NO t 717) ; 


doaiuuxscAC 


(SBQ 


ZD 


110:719) ; 


CHUkGCXyVCTO 


(SBQ 


ID 


N0t721) ; 


M30CACT0CT 


(SBQ 


1» 


M0t723)t 


CabCTCKTTTA 


(SBQ 


ZD 


lK>t72S) ; 


ACVOCTTAAO 


(SBQ 


ZD 


80:727) } 


TOCTTAAGCC 


(SBQ 


ZD 


NO 3 729) ; 


CfnUbQCCTC 


(SEQ 


ZD 


NO:73l) ; 




(SBQ 


ID 


MO:733) ; 


AaCC:TCAATA 


(SBQ 


ID 


NO: 735) t 


CCTCAATAAA 


(SEQ 


ID 


N0:737) ; 



SENT BY:Davles Colllson Cave ; 6- 6-35 ; 18:47 



MELBOURiNE- 



516 742 4366 : #00/3' 



CTCJUITAAAG (SBQ ZD 110:738); 

CAXTAAAOCT (SBQ ZD 80:740); 

AXAAAOCTTO (SBQ ZD N0:742); 

AMUICTTGOC (SEQ ZD 110:744); 

AOCTTOCCTT (SBQ ZD N0t746); 

CrCQCCnGSK (SBQ ZD llOi748)/ 



TOCCTTOMT 


(SBQ 


ZD 


HO:7S0) ; 


CCTTGUU3TOC 


(SSQ 


ZD 


MO«7$2) ; 


TTGUySTGCTT 


(SBQ 


ZD 


IK)t754) ; 


GA8TGCTTCA 


(SBQ 


ZD 


llOt75C); 


GTGCTTCAAa 


(SBQ 


ZD 


llOi7Sa) ; 


OCTTOUbSTA 


(SBQ 


ID 


NOt7C0) ; 


TTCAAOCMST 


(SBQ 


ZD 


N0:7€a) ; 


CAJkOTAaTOT 


(SBQ 


ZD 


)iO:7€4) ; 


AGTAOTCnXIT 


(SBQ 


ZD 


Il0i7€€)| 


TAGrGrTOTOC 


(SBQ 


ZD 


N0t7<t) ; 


OTOTOTQCCC 


(SBQ 


ZD 


N0i770) 1 


OTOTOCcoGrr 


(SBQ 


ZD 


110:772) 1 


GTGOCXSOTCr 


(SSQ 


ZD 


110(774) f 


GOCOOTCTQfT 


(SBQ 


ZD 


N0>77«) 1 


GOOtCTOTTS 


(SBQ 


ZD 


llOt778) 1 


GTCTOTTOTO 


(SBQ 


ZD 


110:780) ; 


CTGTTQTSra 


(SBQ 


ZD 


110:783) ; 


CrrTGIGTOAC 


(SBQ 


ZD 


190:784); 


TQTQTGACTC 


(SBQ 


ZD 


H0t78C)f 


TaTOTGXCTC 


(SBQ 


ZD 


110:788); 


TQTGHCTCTa 


(SEQ 


ZD 


NO: 790) ; 


7C3ACTCTOGT 


(SBQ 


XO 


NOt7»a) ; 


ACTCraOTAA 


(SBQ 


ID 


N0:7»4); 


TCTGOTAACn^ 


(SEQ 


ID 


1I0:79€); 


TGQTA&CTAa 


(SBQ 


ID 


NO: 796) ; 



281 . 

TCMTAAAOC (SBQ ZD BOtVaS) ; 

AXTMUWSCTT («BQ ID lK>t741) ; 

TMkAOCnaC (SBQ ZD M0t743) ; 

JaOKTZGCCT (C8Q ID 1K>:745); 



oenoocTTa 




* ZD Il0t747) ; 


nOOCTtCMO 


(BBQ 


ZD 


1 »D:749) i 




(MDQ 


ZD 


NO (753.) ; 


CTTOMROCT 


C«BQ 


ZD 


110:753} ; 


TGMffldCTTC 


mo 


ID 


110:755) ; 




(BBQ 


ID 


MO: 757) ; 


TBCnCJUkCST 


(MQ 


m 


MOt759) ; 


CTTCMUlTAdI 


(CBQ 


ZD 


1K>:7«1} ; 




l«Q 


ZD 


lK>f70); 




(«HQ 


ZD 


liOt765) ; 




(MQ 


to 


MOt7«7) ; 


AfllXRGTGOC 


<BBQ 


ID 


IR>:7«9) ; 


ravavooocn 


(BBQ 


ID 


110:771) ; 


TSfOOOOGTC 


(BBQ 


XO 


110:773) ; 


TttexKOTcro 


(«BQ 


zo 


110:775) ; 




(BSQ 


XO 


110:777) ; 




C«BQ 


ZO 


«0:779) ; 




<mQ 


ZO 


110:781) ; 




(CBQ 


zo 


110:783) ; 




(«BQ 


ID 


HO:785) ; 


aiQfGSftCTCT 


(SBQ 


zo 


110:787) ; 


OTOTdACTCT 


(SBQ 


ZO 


llOt789) ; 


oraACTCixaa 


(SBQ 


zo 


110:791) ; 


OMcrcTOorrA 


(SBQ 


ZD 


MO:793) ; 


CTGtaaTAAC 


(•BQ 


ID 


110x795) ; 




(•BQ 


ID 


80:797) ; 


(i(niuu:;xAa\ 


(SBQ 




NO:79») . 



111. Anidlu>dforddenniiiiiig1h8p8tfi^^ 

strain infects cdls of an individual, said mediod eosqiriifag aontanlteg a biobgical 
sample itfm said individual with a pqptide eorrespondrng to 8 ddetad or tnmcated 
region of an HIV-l-derived protdn and acreaiing for the absenoe of anlibo<fy Innding 
to said peptide, i^oein the absence of antibody binding is indicative of a del^on or 



SLNT BY:Davies Coillson Cave ; o- b-35 ; la-i6 ; MELBOI'RNE-' 516 742 4366:»0i, 

f1l>MIMi1lHH»JIIV|.U>.^ 

t».nr«ri/>n i» ♦h.t p^,;^ -flirifhr <if ihf mm iiki^i ah lt> uf mU ilimu 

«fHIV-l. 

AawthMlaoocMicUngtodaimni iwho^ 

113. A m^rtt~< 4^ 11^ -*iirni n Hii^ ptpMt i^im m at ii miiili: . 

amiiio adds l«2-]77 ofNef fion QIV-I,^. 

114. A method fbf litwwiiiiiiing ifce |wftniwirfly of • HwiMtf HIV»I after irid HlV-l 
stnin in&Gli celb of m iiidividiiil, nid oMdnd niiiiyal^ pniH ii iiit • biologietl 
— .pi^ * — i^^i^^t ^-.^ ^ rflinrfii Ki ammm* nf ■ pipiilt Iwwtat tii wiimu ilIJ 
aoqucaoe conpridng or vdiidA «iDiM odd! 1^ 

<xmtMt being for • time and uDder ooodffiaM MrfBeknt 

kt^,>j&^i *- ^hnr r rnmj[ilair nrttfi mU pi|iHiIu and 1^— donaihn ihc pi^ n m rr 

of laid oompkx wfaerdntlieabaeaaeofaooaaplKfiiMtodMiwi innf iiitti i fi TIT; 
1 t^M^.^ ..-^ f -ifiti ■ mm irfiiif iiiifii »■!■ Liflin- 1 

115. A .^^ii^ J.^.— t^t nj j i iliniMi i Hj nf aa IBY t iliiii afl ia n M TITV ! 
atndn infiBcli edit of aa iadlvidwd. aaid miillni nniniwiifc^ dtomWi« tibo pnaeaoc 
of amutationindie gMKuneofaiidHIV'l wlMMinttiepKaMMaof thOHHladoiiMsulia 
in a Nef protein whuf a nti a lly lacldat ■aMao teidt 162 to m of Wrf of a wild«<ype 
atrain of HIV-1 and wiMKcia aaid aralation ia faiAeativa of a nnn iiaianwifi atiaia of 
HIV-1. 

116. A method aooonling to daim 115 wtMCtin die jiaMiaii of tfw motatioa is 
indicated by t sabstantial abscDee of bia^ 

pCQtein defined by amino acidt Id to 177 oTiwiUMype IflV-l Nef pcotoin. 

117. A synthetiepq^compriilQg a aeqaoAoe of amino adds defined by 
NO:S01 or a part or fragment tbereof; 



118. Antibodies to the synthetic peptide defined in claim 117. 



-2^3. 

119. A method for detcnniaiqf Hie riilc of «b indiviM «n|Millive Ibr HXV-l 
devdoping avmatoma of AIDS, aaid mtAnA ^Mwyi^Bg r niHiiiliint arilM iilii i fium liJ 
patieat with « cyitilietie peptide deliaed by «|Q ID NOsIM «r • p«t cr derivative 

A^,,^ ui^At^ -'rmflrnittM In wliliifplHii -^Tniini ihn ■nh **— *'«t 

abeeaoe of antibodiet to aaid peptide it iadiailivo of • tovrriik eTdie iadividiial 
devdoping AIDS. 



SENT BY:Davles Colllson Cave ; 6-95 ; 18:49 ; MO-BO* i^E- 516 742 4366;#0 



ABSTRACT 

The present inveotioa leUtes tA tiotHMthoganle stmini of ifiV^l and to conapoiitnts, 
pirti frmpuriifff nnri flrriririiTi tfinrnnf inil In nwinlir wwumimw <taiimJ fliumflam md 
thdr uae in the dcvclopmm t of diagnoftic and thcnpeotic cxmpoHrftiaw Ibr tlie treatmeat 
and ixt^;>hylaxis of AIDS and AIDSHvla^ ThaptwHrtkmHttkm also relates 

to a m^hod Ibr attrmting pathogemc atraiaa of WV-l kgf Miifipaihiuft p^n^i^r 
regiws of the HIV-1 flCDome. 



I /to I 



MM 

GAACAGATTIGGAATAACATGACCTGGA'tGGAGTGGGACAGAGAiUVTTAA 8121 

*** »*'t***'«4r * ******** *i ** ******* * ******* * 

GAAGAGATTTGGGAGAACATGACCTGGATGCAGT6G6AAAAAGAAATTCA 
*** ****** *********ii** ***** ******* ********* * 

GAA^CAATTTGGGATAACATGACCTGGATCCACTGGOAAAGAGAAATTGA 
*** ****** ****************** ******* ********* * 

GAAACAATTTGGGATAACATGACCTGGATGCAGTGGGAAAGAGAAATTGA 

******** 

GAAATTAA 



^'^''^'^^r^i^i^^ui^j^^ B171 

ff?n?S?5?if?^I??r?*^CACCTTM^ 

mmmmwm****** ****•♦♦*♦**♦*♦**♦♦ * 

CAATTACACAAGATTAATATACAACTTAATTGAAGAATCGCAGAACCAAC 



AAGAAAAGAATGAACAAGAATTATTGGAATTAGATAAATGGGCAAGTTTG 822 1 

AAGAAAACaVATGAACWlGAACTATTGGAATTGGATCAATGGGCAAGTTTG 

AAGAAAAAAATGAACTAGAATTATTGGAATTGGATAAATGGGCAAATTT6 

AACAAA7UIAATGAACTAGAATTATT6GAATTGGATAAATGGGCAAATTTG 

AAGAAAAGAATGAACAAGACITATTGGAATTAGATAAATGGGCAAGTTTG 



TGGAArTGGTTTAACATAACAAATTGGCTGTOGTATATAAAATCATtCAT 6271 

AAA«A4««4« « « A A 4 4 « 4 A 4 « 4 4 « 4 A 4 44AA444 

TGGAATTGGTTTGACATAACAAAATGGCTGTGGTATATAAAAATATTCAT 
TGGAATTGGTTTA0TATATCAAACTGGCTAT66TATATAAAATTATTCAT 

♦♦♦♦♦♦♦♦♦♦♦♦♦♦ ♦♦♦♦ 

TGGAATTGGTTTAfiTATATCAAACTGGCTATGGTATATAAAATTATTCAT 
TGGAATTGGTTTGACRTAACAAGTGGGCT6TCGTATATAAAATTATTCAT 



AATGATAGTAGGAGGCTTGGTAGGTTTAAfiAATAGTTTTTGCTGTACTTT 8321 

AATGGTAGTAGGAGGCTTGATAGGTl'TAACAATAGTTrTTGCTGTACTTT 

AATGGTAGTAGGAGGCTTGGTAGGITTAAGAATAGTTTTTACTGTACTTT 
**** ft******************'*'*^*****^******* 

AATGCTAGTAfiGAGGCTTGGTAGGTTTAAGAATAGTTITTACTGTACTTT 

**************** *«-«4-<«.«««'4-«'«*«*««4 

AAT6ATAGTAGGAGGCTTGGTAGGTTTAAGAATA6TTTTAGCTGTACTTT 



SA8 SA9 9A10 
Cl*A'rACTCAnTAcJvCTT3vCCCACCCATATTavCC7lTTATCCTTTC?S^ »371 
CTATAGTGAATAGAGTTAGGCAGGGArACrCACCATTGTCGTTTCAOACC 
CTATAGTTAATAC5AGTTAGGCAGGGATACTCACCATTATCGTTTCAGACC 
CTATA6TTAATAGAGTTAGGCAGGGATACTCACCATTATCGTTTCAGACC 
CTATAGTGAATAGAGITAGGCAGGGATACTCACCAITATCGTTTCAGACC 



4/IOI 

Tat teralnation NL4) 

CACCTCCCAATCCCGAGGGGACCCGACAGGCCCGAAGGAATAGAAGAAGA 842 1 

CTCCTCCCAACCCCGAGGGGACCCGACAGGCCCGAACGAATCGAAGAACyi 
********** ***** *************** ******** ******** 

CACCTCCCAACCCCGAACGGACCCGa\CAGGCCAfiAAG6AATCGAAfiAAGA 

********** ******** ******** 

CACCTCCCAACCCCGAGGGGACCCGACAGGCCAGAAGGAATCGAAfiAAGA 

••*•••*♦♦♦ ♦♦♦♦♦•♦•♦♦************ A44i**** ******** 

CACCTCCCAACCCCGAGGGGACCC6ACAGGCCCGAAGGAATCGAAGAAGA 



AGGTGGAGAGAGAGACAfiAGACAGATCCATTCGATTAGTGAACGGATCCT 6471 
*************m*t,t,^<^t*******^^, ****^t**** ****** ** 

AC«TGGAt»GACAGACAGAGACAiC»TCau:TCCATTACTACA^^ 
*****************,*** *******7** 



AGGtGGAGAGAGAGACAGAGACAGATCCAGTCGATTAGTGCACGGATTCT 

C18S, C18M * C99R Tat t«nin«tion 



6/(0\ 



TAGCACTTATCTGGGACGATCTCJCGGAGCCTGTGCCTCTTCAGCTACCAC 8S2 1 
******** ********** *** *************** «k « ******* 

TAGCACTTTTCTGGGACGACCTGA6GAGCCTGTGCCTCTTCCTCTACCAC 
******** ********** *** *♦** «****♦****♦**♦•**♦•♦♦ 

TAGCACTTTTCTGGGACGACCTGAGGAGTCTGTGCCTCTTCAGCrACCAC 
******** ****** «** ♦*** ******♦**♦*♦*♦♦♦*♦•♦♦ 

TACCACTTTTCTGGGTCGACCTGAGGAGTCTGTGCCTCTTCAGCTACCAC 

******** ****** *** *** ***4 ****<t^«*^**********^*** 

TAfVy^CTTTTCTGGGTCGACCTGAGGAGCCTGTGCCTCTTCAOCTACCAC 



CGCTTGAGAGACTTACTCTTGATTGTAAC6AGGATTGTGGAACTTCTGGG 8571 

* *************************** ******************** 

CACTTGAGAGACTTACTCTTGATTGTAACAAGGATTGTGGAACTTCT6GG 

* ************************************************ 

CACTTGAGAGACTTACTCTTGATTGTAACGAGGATTGTGGAACTTCTGG6 
************************************************** 

CGCTTGAeAGACTTACTCTTGATTGTAACGAGGATTGTGGAACTTCTGGG 
CGCTTGAGAGACTTACTCTTGATTGTAACGAGGATTGTGGAACTTCTGGG 



ACGCAGGGGGTGGGAAGCCCTCAAATATTGaTGGAATCTCCTACACTATT 8 62 1 

ACGCAGG<MATGGGAAGCCCTCAAATATTGGTGGAACCTCCTAAAGTATT 

ACGCA6GG6ATG6GAAGCCCTCAAATACTGG1GGAATCTCCTGCAGTATT 
***************** ************** ******* 

ACGCGGGGGATGGGAAGCCCTCAAATACTGGTGGAATCTCCTGCAGTATt 
********************************************* **** 

ACO'CiUjO'UUUTUGGAAGCCCTCAAATATTGGTGGAATCTCCTACAXTATT 



KL43 Rftv t«radnaMon 

GQAGTCAGGAACTAAAGAATAGTGCTGTTAACTTGCTCAATGCCACAGCC 8671 
*♦*♦ **♦♦*♦♦♦ **** ********** *** 

fSGAGCCAGGAACTGCAGAAGAGTGCTGTTATCTTGCTCAATGCCACCfiCC 

**** ***!****** **** ********** tA* *** 

GGAG6CAGGAACTACAGAKGAGTGCTGTTAGCTTGTTCAATGGCACC6CC 

**** ********* **** ***'4i* **** ************** 

GGAGACAGGAACTACAGAAGAGTGCAGTTAGCTTGTTCAATGCCATAGCC 
************* ***** ****** ********** *** 

GGAGTCAGGAXCTCAAGAAGAGTGCTATTAGCTTGTTCAATGCCACCGCC 

C19S, C19M « CftIN R*v termination 



lho\ 



ATAGCACTAGCTGAG6GGACAGATACGGTTATAGAAGTATTACAAGCAGC 8721 

•ir****4**««*«r«r«««4c«4r4 4« **•* ********* 4 « *4* 

ATAGCACSTAGCTGAGGGGACAGATAGAGTTTTAGAAGTATTACAAAGAGC 
******************** ik****« ********** **** * *« 

ATAGCAGTAGCTOAGGGGArAGATIlGAGTTATAGAAGCTTTACCAAGGGC 
♦****♦♦♦♦*♦♦♦♦♦♦♦***♦*♦*** * ♦*♦**♦«* ******* *** 

ATAGCA6TAGCTGAGGGGACAGATAGAGCTATAGAAGCATTACAAACACC 

«'4^**********4*****«^*4t»*** ****************** *«* 

ATAGCAGTAOCTGAGGGGAnAGATAGAGTTATAGAAGTATTACAAAGAGC 

«»^«#,.s<<«<<» » AGGGGACAGATAGAGTTCTAGAAGTATTACAAAGAfiC 
036P R«v fceminatlon 



TTATAGAGCTATTCGCCACATACCTAGAAGAATAAGACAGGGCTT6GAAA 8771 
TTATAGA6CTATCCTCCACATACCTAGAAGAATAAGACA66GCCTCGAAA 

*********<**«** **********«**«***«***44«***«** *«*« 

TTATAGAUCtATTCU'CCACATACCTAGAAGAATAAGACAGGGCTTAGAAA 

*'«**«**4*****« ***««A**««*««**««*«**«*«***«** **«* 

TTATAGAGCTATTCTCCACATACCTAGAAGAATAAGACAG6GCTTAGAAA 
TTGTAGAaCTQTTCTCCACATACCTAGAAGAATAAGACAGGGCTTCGAXA 
TTATAGAGCCATTCTCCACATACCTAGAAGAATAAGACAGGGCTTCGAAA 



Env t«mindti.on mc£ stare 

GGATTTTGCTATAAiGATGGaTGGCAAGTGGTCAAAAAGTAGTGTGATTGG 8821 

TGGCTTTGCTATAAARTGGGTGGCAAGTGAGCAMAAGTAGTGTAGTCAG 
** V* ** * 

GGGCTTTGCTATAAAATGGGTGGCAAGTGGTCAGAAAGTAGTGTGCTTAG 

GGGCmOCtGTAA AATG GG 

GSGCTATGCTATAAAATGGGTGGCAAGTGGrTAAAAAGTAGTATGGTTAg 

TGGCTTTNCTATAAAATGGGTGGCAAGTGGGCAAAAGGTAGTGTACTTAG 

D36P M«f tarmlnation 



n«f duplication region 
ATGGCCTOCTGTAAGGGAAAGAATGAGACfiACCTGAOCCAGCAGCACATG »87 1 

4* «* 

ATACCATG— — — — — * CATCATAAG 

it 

AAGGCATG 

ATCGCXTGCTCTAAGGGAAAAAATGAAACAAGCTGAGCCAGCACa^JAAG 
ATGGCAGGCrGTAAGGGAAAAA-TAAAACGAGCTGAGCCAGCAGCACAAG 



GGGTGGGAGCAGTATCTCGAX5ACCTAGAAAAACATGGAGCAATCACAAGT 892 1 

GGGTGGGGGC 



G6GTGGGAGCAATATCTCGAGACCTAGGAAAACATGGAGCAATCCCAAGT 
G6GTGGGAGC 



AGCAATACAGCAGCTAACAATGCTGCTTGTGCCTGGCTAGAAGCACAAGA 8 971 
CAACAACTAACAATGCTGATCGTGCCTGGCTAGAAGCACAAGA 



«4 *********** ************************ 

AGCAATACAACAACTAACAATGCTAATTGTGCCTGGCTAGAAGCACAARA 
** ** ******** *** ******************* 

^CAACAACTAACAATTCTGGTTOCGCCTGGCTAGAAGCA— — 



\o/\o\ 

GGAGGAAGAOGTGGGTTTTCCAGTCACACCTCACCTACCTTTAAGACCAA 902 1 

GAAGGAAGIVAGCGGGTTTTCCAGTCAAACCTCAGGTA 

— TACCTTTAAGAC— - 



****** ** *********^**it*** *********************** 

GGAGGACGAAGTGGGTTTTCCAGTCAAACCTCAGGTACCTTTAAGACCAA 



Poly purine traet 

TGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAagGGG 9071 
****** **************'H,n,**'^^**w*-^ii 

— GCTGTAGATCTTAGCCACfTTTT AAAhGAAAAGGGG 

********* ******A-***** *********m****i'l * Vir' 

• AAGGCAGCTAYAGATC TTAG CCGCTTTTT AAAAGAAAAGGGC 



•GATCTTAGCCACTTTTTAAAAGAAAA< 



TGACTTACAAG GCC31CTTT TTAAAAAGAAAGGGG 

*******^*^** il****i 



— CACTTTT TAAAAAGAAAGGGG 

cits nef termination 

CISM and C9SH naf Tanainatioa 



u/io\ 



_J U3 

GGACTCGAAGGGCTAATTCACTCCCAAAGAAGACAAGATATCCTTGATCT 9121 

GGACTOOAAGGGCTAATTCACTCCCAAAGAAGACAAGATA 

CGACTGGAAGGGCTAATrcy^CTCACACAGAAGA-' 

GGACTGGAAGGGCTAATTCACTCACAGAGAAGA 

¥T************** ***♦**♦♦ ♦*♦*♦♦•*♦♦«*♦♦♦**♦♦♦♦•«*♦ 

GGAClGGAAGGGCTAATrcyiCTCCTAAAGAAGACAAGATATCCTTGATCT 
GGACTdGAAGGGCTAATACGCUCCCAAAGAAGACAAGATATCCTTGATCT 



GTGGATCTACCACACACAAGGCTACTTCCCTGATTGGCAGAACTACACAC 9171 



**«*•••*•«*«**«'*«•**«*«** 

TTCGATCtACCACACACAAGGCTACl' 

6T0GGGCTACCACACACAAGGCTACTTCCCTGATTGGCAGAACGACACAC 



\l{\0\ 



{ . .dyad cynoMtry. . 
{. .NRT-1 

t NRE — > 

CAGGGCCAGGGGTCAGATATCCACTGACCTTTGOATGGTGCTACAAaCTA 922 1 

* *** 

CACA6TGCT6CAAACTA 



********** ***** ******* *** ** 
ATCCACTCACTTTTCG, TGGTGCTTCAAATTA 

tt****4ttri,*** ** 4,****************** t****** * *** * * 

CAG6GCCAG66ACCAGATATCCAC1'GACCTTTCCATGGT6CTGCAAACGA 



• • • m 9 } 

J 

»yj> wr-AT 

CTACCAGTtGAGCCAGArAAGGTAGAAGAGGCCAATAAAGGAGAGAACAC 9271 
******* **tt *** ****************** ****** *«* * 

TA CCAGTG GAGTCAGCGAAGATAGAASAGSCCAATGGA6CACAAAACCA 
****** * * * * * ** **« *** **** 

>«- TCAGTTGA ACCAGAAGAAGATA6AAGAGGCCATGAAGAAGAAIUlCAA 
*•♦•♦•* ****« * * • * * ** *** *«* **** 

^-- TCAGTTG AACCAGAAtSAAGAATQAAgAgCCCaTgAACAJlGAAAACAA 
** ***** ** ******* ***** *********** 

6TACCA6TGGANCCAGA--AGAaAGAAGAGACCAATGGAGGAGAGAACA- 
******* It* **** *** ************** ****** **** 

GTA CCAGGGG AAACAGAGAAGATAGAAGACGCCAATGCAGGAGAAAACAA 
(nybt 



VF-AT 

CAGCTTGTTACACCCTGTGAGCCTGCATGGAATGGATCACCCTGAGACAG d 32 1 

CAGATT6TT 

CAGAXTGTT 

**« ***** 

CAGATTGTT ^ 



* * * **«* 

CAGACTGTT 




9371 



* * « 

CCGTTTGTT 
** 

-CT6CT 



14-/(01 

TCP- la 



^M«£ tenninacien 

CGAGACCTGCATCCGGAGTACTTCA AGAACTGCTGACATCGAGCTTGCTA 9421 

* * * * * It ■* * « 

CTGTT GGGGACTTTCCA TCCGTTG G6GACTTTCCA A6GCSGCGTGGCCTG 

* * * ★ * *« 

CCGTT GGGGACTTTC CA, ,,,,,, GGAGACGTGGCCTGAGTGATAAGCCG 
♦ ***♦ ♦ 

TGCTCAGCT GSGGACTTTCC AgAAGGCGCGGCCTGAGTGACTAAGCCCCG 

CAGAGTGT GGGGACTCTCC ACAACAGAGTGT GGGGACTTTCCA AGGACGC 
* ♦ * * * * * 

— ^-CCGTTGGGGAgTTTCCAAGGAGGCGTGGCCTGAGTGACTAAGTTCC 
V3€f, C1»S, CISM C cHh extrs NFAD 

D36P c C9eH Bxtra NPAB 



NF/a Spl 

CAAGGGACTTTCCC, CTGGGGACTTTCCAG« GGAGGCGTGGC d46l 

GGTGACTAGTT2CG# GTGG GGACTTTCCA A, GAAGGCGCGGC 

CT GGGGACTTTCCG AAQAGGCGrGA CCGGACyTlCCA A. GGCGACGTGGC 
** *T* ** * ** * 

T TGGGACTTTCCG AAGAGGCATGA AQGQj^TTTCC AAS. 6CAGGCCTGGC 
* ♦ *♦♦***♦*♦♦♦** *♦♦•♦ «** 

GTGGCCTGAGTGACTAA GTTCCG TT GGGGACTTTCCA A^ AAAGGCGAGGC 
♦♦*♦*•♦*♦•* «*♦*♦«♦♦♦•»*♦♦ ******* 

GTTGGGAST TTCCA AGGAGGC , , GCGGGGACTTTCCAA, GGAGGCGCGGC 

Clt« 4 CUM NP/O NTkB 5?1 

and C98H 3' -half NncB 



l5/lOl 



Spl Spl TATA box 

CTGG6CGGGACTGGG0AGTGGCGAGCCC , TCAGATGCTGCATATAAGCAG 9S 1 0 

CTGGGCGGGACTGG6GAGTGGCGA GCCC , TCASATGCTGCATATAAGCAG 
♦ •****«*****73TT»Tr3r**T* 

CTGGGCGGGACTG6GGAGTGGCGA 6CCC, TCAGATGCTGC ATATAAG CAG 

CTGgGCGGGACllgGGGAGTG GCGAfirrC, tCAGATGCTCC ATATAA CCAG 

***** **************** **** 

CTGGGCGGA-CTGGGGAGTGC-GA GCC- , TCAGATGCYGC ATATAG GCAG 

C TGGGCGGGAC T GGGGAGGGGCGA GCCC, TCAGATGCTGC ATATAA GCAG 
Spl Spl 



U3 R 

J I T AR 

CTGCTTTTTGCCTGTACTGGGTCTCTCTGGTTAGACCAGATCTGAGCCTG 9.S<V0 

CTGCTTTCTGCTGTTACTGOGa'CTCTCGGGTTAGACCAiSATCTGAGCCTG 

****** *«****«********«**4****«**<******4********** 

CTGCTTTCTGCCTGTACTGGGTCrCTCTGGTTAGACCAGATCTGAGCCTG 
****** ******«********««**«*♦«♦**«** A************* 

CTGCtTTCTGCCTGTACTGGGTCTCTCTGGTTAGACCAGATCTGAGCCTC 
******* **********•********************>***********'* 

CTGCTTTCTGCCTGTACTGGGTCTCTCTGGTTAGACCAGATCTGAGCCTG 

*«**«*« «*************4*************«**«'*******«**^ 

CTGCTTTCTGCCTGTACTGGGtCTCTCTGGTTAGACCAGATCTGAGCCT . . lneo«pl*fe* 



Spl 



TATA box 



D36P 
C18S 
C18M 
C9SH 
C54R2 



CTGGGCGGGACTGGGGAGTGGCGAGCCC , TCAGATGCTGCATATAAGCAG 
C T6GGCGGGACTGGGGAGTGGCGA GCCC/ TCAGATGCTGCATATAAGCAG 
CTGGQCGGGACTGGGGAGTGGCG AGCCC/ TCAGATGCTGC ATATAAG CAG 
CTGGQCGGGACTGGGGAGTGGCG RGrCC, TCA6ATGCTGC ATATAAC CAC 

4r«****«*«4r ***** **************** ♦*♦* 

CTGGGCOGA-CTGGGGAGTGC- GAGCC- , TCAGATGCTGC ATATAG GCAG 
C TGGGCGGGAC T GGGGAGCGGCGA GCCC, TCAGATGCTGC ATATAA GCAG 



9510 



Spl 



U3 R 

1 [ t AR 
NL43 CTGCTTTTT6CCTGTACT60GTCTCTCTCGTTAGACCAGATCXGAGCCTG ^f»M 

•****«« «** ************* •••♦*•*****♦**♦♦**♦*** 

D3 6P CTCCTTTCTGCTGTTACTGCCTCTCTCGOGTTAGACCAGATCTGAGCCTG 
***«*« *«***«« ******«***•**«««***** «*****««« 

CI 8S CTGCTTTCTGCCTGTACTOaGTCTCTCTGGTTAfiACCAGATCTSACCCTG 

**♦♦♦♦*♦*♦«*«♦*♦*♦♦♦♦♦*♦*♦*****♦**•******♦♦ 

C18M CTGCTTTCTCCCTGTACTG6GTCTCTCTGGTTAGACCAGATCTGAGCCTC 
******* ****************************************** 

C9 8H CTGCTTTCTGCCTGTACTGGGTCTCTCTGGTTAGACCAGATCT6AGCCT6 
******* ****************************************** 
C5 4H2 CTGCTTTCTGCCTGTACTGGGTCTCTCTGGTTASACCAGATCTGAGCCT . . iiveompl*fe* 



16 



17/ 101 



AACTAG^GATCCCTCAGACCCTTTTAGTCAGTGTGGAAAATCTCTAGCA 9709 
ATCTAGA 1305 
ATCTAGA 1209 

ATCTAGAGATCCCTCAGACCATTTTAGTCCGTGTGGAAAATCTCTAGCA END 
* ***♦» 

ATCTAGA 1399 



FIGURE 2 



2o/ioi 
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TGGAAGGGCTAATTCACTCACGGAAAAGACCAGTTGAACCAG 

AAGAAGATAGAAGAGGCCATGAAGAAGAAAAGAACAGATTGT 

TCTGCTTGCTCAGCTGGGGACrrTTCCAGAAGOCGCGGCCTGA 

GTGACTAAGCCCCGTTGGGGACTTTCCGAAGAGGCATGAAGG 

GACITTCCAAGGCAGGCGTGGCCTGGGCGGGACTGGGGAGTG 

GCGAGCCCrCAGATGCTGCATATAAGCAGCTGCTTTCTGCCT 

GTACTGGGTCTCrCTGGTTAGACCAGATCTGAGCCTGGGAGC 

TCTCTGGCTAGCTAGGGAACCCACTGCTTAA6CCTCAATAAA 

GCTTGCCTTGAGTGCTTCAAGTAGTGTGTGCCCGTCTGTTGT 

GTGACTCTGGTATCTAGAGATCCCTCAGACCATTTTAGTCCG 

TGTGGAAAATCTCTAGCAGTGGCGCCCGAACAGGGACTTGAA 

AGCGAAAGGAAAACCAGAGGAGCTCTCTCGACGCAGGACTCG 

GCTTGCTGAAGCGCGCACGGCAAGAGGCGAGGGGCGGCGACT 

GGTGAGTACGCCGAAAATTTTGACTAGCGGAGGCTAGAAGGA 

GAGAGATGGGTGCGAGAGCGTCAATATTAAGCGGGGGAAAAT 

TAGATAGATGGGAGAAAATTCGGTTAAGGCCAGGAG6AAAGA 

AAAAGTATAAATTAAAACATAT?AGTATGGGCAAGCAGGGAGC 

TAGAACGATTCGCAGTCAATCCTGGCCTGTTGOAAACATCAG 

AAGGCTGTAGACAAATACTGGGACAGTTACACCCaTCCCTTC 

AGACAGGATCAGAAGAACTTAAATCAGTATATAATGCAGTAG 

CAGTCCTCTATTGTGTGCATCAAAACATAaACATAAAGGACA 

CCAAGGAAGCTTTAGAAAAGATAGAGGTU^GAGCAAAACAAAT 

GTAAGAAAAAAGCACAGCAAGCAGCAGCACA6CAAGCAGCAG 

CTGGCACAGGAAACAGCAACCCGGTCAGCCAAAATTACCCTA 

TAGTACAGAACATGCAGGGGCAAATGGTACATCAGGCCATAT 

CACCTAGAACTTTAAATGCATGGGTAAAAGTAATAGAAGAGA 

AGGCTTTCAGCCCAGAGGTAATACCCATGTTTTCAGCATTAT 

CAGAAGGAGCCACCCCACAAGATTTAAACACCATGCTAAACA 

CAGTGGGGGGACATCAAGCAGCTATGCAAATaTTAAAAGAGA 

CCATCAATGAGGAAGCTGCAGAATGGGATAGATTACATCCAG 

CGCAGGCAGGGCCTGTTGCACCAGGCCAGATGAGAGACCCAA 

GGGGAAGTGAaiTAGCAGGAACTACTAGTACCCTTCAGGAAC 

AAATAGGATGGATGACAGGTAATCCAGCTATCCCAGTAGGAG 

AAATCTATAAAAGATGGATAATCCTGGGATTAAATAAAATAG 

TAAGGATGTATAGCCCTATCAGCATTCTGGACATAAAACAAG 

GACa^AAGGAACCCTTTAGAGACTATGTAGACOGGTTCTATA 

AAACTCTAAGAGCCGAGCAAGCTACACAGGAGGTAAAAAATT 

GGATGACAGAAACCTTGTTGGTCCAAAATGCAAACCCAGATT 

GTAAGACTATTTTAAAAGCATTGGGACCAGCAGCTACACTAG 
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AAGAAATGATGACAGCATGTCAGGGAGTGGGAGGACCCAGCC 

ATAAAGCAAGAGTTTTGGCAGAAGCAATGAGCCAAGCAACAA 

ATGCAGCTACTGTAATGATGCAGAGAAGCAATTTTAGAAACC 

AAAGAAAGAATGTTAAGTGTTTCAATTGTGGCAAAGAAGGGC 

ACATAGCCAGAAATTGCAGGGCTCCTAGGAAAAGGGGCTGTT 

G6AAATGTGGAAAGGAAGGACACCAAATGAAA6ATTGTACTG 

AGAGACAGGCTAATTTTTTAGGGAAAATCTGGCCTTCCCACA 

AGGGGAGGCCAGGGAACTTTCTTCAGAGCAGGCCAGAACCAA 

CAGCCCCTCTCCAGGGCAGGCCGGAGCCATCAGCCCCGCCAG 

AAGAGAGCTTCAGGTTTGGGGAGGAGACAACAACTCCCTCTC 

AGAAGCAGGAGCCGATAGACAGGGACAGGGATCTOTATCCTT 

TAGCTTCCCTCAGATCACTCITTGGCAACXJACCCCTCGTCAC 

T^fAAAGATAGGGGGGCAGCTGAAGGAAGCTCTATTAGATAC 

AGGAGCAGATGATACAGTATTAGAAGACATGCATTTGCCAGG 

AAAATGGAAACCAAAAATGATAGGGGGAATTGGAGGTTTTAT 

CAAAGTAAAACAATATGATGAAATTCTTGTAGAAATCTGTGG 

ACATAAAGCTATAGGTACAGTATTAGTAGGACCTACACCTGT 

CAACATAATTGGAAGAAATCTGTTGACTCAGATTGGTTGCAC 

TTTAAATTTTCCCATTAGTCCTATTGAAACTGTACCAGTACA 

ATTAAAGCCAGGAATGGATGGCCCAAAGGTTAAACAATGGCC 

ATTGACAGAAGAGAAAATAAAAGCATTAGTAGUUULTTTGT^ 

AGAAATGGAAAAGG/^GGAAAGATirrCAAAAATTGGGCCTGA 

AAATCCATACAATACTCCAGTATTTGCCATAAAGAAAAAAGA 

TGGTACTAAATGGAGAAAATTAGTAGATTTCAGAGACCTTAA 

TAAGAGAACTCAAGACTTCTGGGAAGTTCAATTAGGAATACC 

ACATCCCTCAGGATTAAAAAAGAAAAAATCAGTAACAGTACT 

GGATGTGGGTGATGCATACTTTTCAGTTCCCTTAGATGAAAA 

CTTCAGGAAGTATACTGCATT^ACCATACCTAGTATAAATAA 

TGAGACACCAGGGATTAGATATCAOTACAATGTGCTTCCACA 

GGGATGGAAAGGATCACCAGCAATATTCCAAAGTAGCATGAC 

AAGAATCTOAGAGCCTTTTAGAAGACAAAATCCAGACATAGT 

TATCTATO^ATAO^TGGATGACriTOTATGTAGaATCTGATTT 

AGAAATAGGACAGCATAGAATAAAAATAGAGGAACTGAGACA 

ACATCTGTTGAAGTGGGGATTTACCACACCAGACAAAAAGCA 

TCAGAAAGAACCCCCATTCCTTTGGATGGGTTATGAACrCC^ 

TCCTGATAAATGGACAGTGCAACCTATAGTACTGCCAGAAAA 

AGA(^GCTGGACTOTa^TGACATACAGAAGTTAGTGGGTAA 

ATTAAATTGGGCAAGTCAGATTTACCCAQGAATTAAAGTAAG 

GCAATTATGTAAACTCCTTAGGGGAACCAAAGCACTAACAGA 
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AGTAATACCACTAACAGAAGAAGCAGAGCTAGAACTGGCAGA 

AAACAGGGAAATTCTAAGAGAACCAGTACATGGAGTGTATTA 

TGACCCATCAAAAGACTTAATAGCAGAAATACAGAAGCAGGA 

GCyVAGGCCAATGGACATATCAAATTTATCAAGATCAATTTAA 

AAATCTAAAAACAGGA7UVGTATGCAAGATTGAGGGGTGCCCA 

CACTAATGATGTAAAACAATTTCCAGAGGCAGTGCAAAAAAT 

AGCCACAGAAAGCATAGTAATATGGGGAAAGACTCCTAAATT 

TAGACTACCCATACAAAAAGAAACATGGGACGCATGGTGGAC 

AGAGTATTGGCAAGCCACCTGGATTCCTGAGTGGGAGTTTGT 

CAATACCCCTCCCCTAGTAAAATTATGGTACCAGTTAGAAAA 

AGAACCCATAATAGGAGCAGAAACTTTCTATGTAGATGGGGC 

AGCTAACAGAGAGACTAAATTAGGAAAAGCAGGATATGTTAC 

TGACAGAGGAAGACAAAAAGTTGTCTCCCTAACTGACACAAC 

AAATCAGAAGACTGAGTTACAAGCAATTCATCrAGCrTTGCA 

GQATTCAGGATTAGAAGTAAACATAGTAACAGACTCACAGTA 

TGCATTAGGAATCATTCAAGCACAACCAGATAAAAGTGAATC 

AGAAATAGTCAATCAAATAATAGAGCAATTAATAAAAAAGGA 

AAAGGTCTACCTGGCATGGGTACCAGCACACAAAGGAATTGG 

AGGGAATGAACAAGTAGATAAATTAGTCAGTGCTGGAATCAG 

GAAAATACTATTTTTAGATGGAATAGATAAGGCACAAGAAGG 

CCATGAGAAATATCAO^GTAATTGGAGAGCAATGGCTAGTGG 

TTTTAACCTGC»CCTATAGTAGCAAAAC3AAATAGTAGCCAG 

CTGTGATAAATGTCAGCTAAAAGGAGAAGCCATGCATGGACA 

AGTAGACTGTAGTCCAGGAATATGGCAACTAGATTGTACACA 

TCTAGAAGGAAAAATTATCCTGGTAGCAGTTCATGTAGCCAG 

TGGATATATAGAAGCAGAAaTTATTCCAGCAGAGACAGGGCA 

GGAAACAGCATACTTTATCTTAAAATTAGCAGG AAGG TGGCC 

AGTAAACACAATACATACAGACAATGGCGGCAATTTCATCAG 

TACCACGGTTAAGGCCGCCTGTTGGTGGGCAGGGATCAAGCA 

GGAATTTGGCATTCCCTACAATCCCCAAAGCCAAGGAGTAGT 

GGAATCTATGAATAGAGAATTAAAGAAAATTATAGOACAGGT 

AA6AGATCAGGCTGAACATCTTAAGACAGCAGTACAAATGGC 

AGTATTCATCCACAATTTTAAAAGAAAAGGGGGGATTGGGGG 

ATACAGTGCAGGGGAAAGAATAGTAGACATAATAGCAACAGA 

CATACAAACTAAAQAATTACA7UAGCAAATTACAAAA ATTCA 

AAATTTTaXSGTTTATTACAGGGACAGCAGAGATCCACTTTG 

GAAAGGACCAGCAAAACTTCTCTGGAAAGGCaAAGGGGCAGT 

AGTAATACAAGATAATAGTGACATAAAAGTAGTOCCAAGAAG 

AAAAGTAAAGATCATTAGGGATTATGGAAAACAGATGGCAGG 
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TGATGATTGTGTGGCAAGTAGACAGGATGAGGATTAGAACAT 

GGAACAGTTTAGTGAAACACCATATGTATGTTTCAAAGAAAG 

CTAAGGGATGGATTTATAGACATCACTATGAAAACACTCATC 

CAAAAATAAGCTCAGAAGTACACATCCCACTAGGGGAAGCTA 

GATTGGTAATAACAACATATTGGGGTCTACATACAGGAGAAA 

GAGACTGGCATTTGGGTCAGGGAGTCTCCATAGAATGGAGGG 

AAAQGACATATAGAACACAAGTAGACC CCGAAC TAGCAGACC 

AACTAATTCATATACATTACTTTGATTGTTTTTCAGAATCTG 

CCATAAGAAGTGCCATATTAGGATATAGAGTTAGGCATAGGT 

GTGAATATCAAGCAGGACATAACAAGGTAGGATCTCTACAGT 

ACTTGGCACTAACAGCATTAATAACACCAAAGAAGATAAAGC 

CACCTTTGCCTAGTGTTGCGAAACTGACAGAGGATAGATGGA 

ACAAGCCCCAGAAGACCAAGGGCCACAGAGGCAGCCATACAA 

TGAATGGACTVCTAGAACTTTTAGAGGAGCTTAAGAATGAAGC 

TGTTAGGCATTTTCCTAGGGTATGGCTCCATGGCTTAGGGCA 

ACATATCTATQAAACnTATGQGGATACTTQGGAAGGAGTGGA 

GGCCT^TAACAAGAACTCroa^CAACIHSCTGTTTACT 

CAG7UVTTGGGTGTCAACATAGCAGAATAGGCATTATTCGACA 

GAGGAGAGCAAGAAATGGAGCCAGTAGATCCTAGACTAGAGC 

CCTGGAAGCATCCAGGAAGTCAGCCTAAGACTGCGTGTACCA 

CTTGCTATTGTAAAAAGTGCTGCrrTCATTGCCAAGTTTGOT 

TTATGAOVAAAGGCTTAGGCATCTCCTATGGCAGGAAGAAGC 

GGAGACAGCGACGAAGAGCTCCTCAAGACAGTCAGACTCATC 

AA6CTTATCTATCAAAGCAGTAAGTAATATATGTAATGCAAC 

CTTTACAAATAGTAGCAATAGTAGCATTAGTAGTAGCAGGAA 

TAATAGCAATAGTTGTGTGGACCATAGTATTCATAGAATATA 

AGAAAATATTAAGACAAAGAAAAATAGACAGGTTGATTGATA 

GAATAAGAGAAAGAGCAGAAG^CAGTGGCAATGACAGTGAAG 

GGGATCAGGAAGAATTATCGGCACTTGTGGACATGGGGCACC 

ATGATCCTTGGGATATTAATGATCTGTAGAQCTGCAAACAAT 

TTGTGGGTCACAGTCTATTATGGGGTACCTGTGTGGAGAGAA 

GCAACCACCACTCTATTTTGTGCATCAGATGCCAAGGCATAT 

GATGCAGAGGTACATAATGTTTGOGCCACACATGCCTGTGTA 

CCCACAGACCCTAACCCACAAGAAGTAGAATTGAAAAATGTG 

ACAGAAAATTTTAACATGTGGAAAAATAACATGGTAGAACAG 

ATGCATQAGGATATAATCAGTTTATGGGATCAAAGCCTGAAG 

CCATGTGTAAAATTAACCCCACTCTGTGTTTCTTTAAATTGC 

ACTGATGCTACTAATACCACTAATAGTAATACCACTAGCAGC 

AGCGAGAAACCGAAGGGGACAGGGGAAATAAAAAACTGCTCT 
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TTCAATATCACCACAAGCATAAGAGATAAGGTGCAG/^CAA 

TATGaVCTTTTTTATAGCCTTGATCTAGTACCAATGGATGAT 

AATGATAATAGTACAAGCTATAGGTTAATAAGTTGTAACACC 

TCAATCATTACACAGGCCTGTCCAAAGA TATCC TTTGAGCCA 

ATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTA 

AAG1X3TAAAGATAAAAGGTTCAATGGAAAAGGACCATGTACA 

AGTGTCAGCACAGTACAGTGTACACATGGAATTAGGCCAGTA 

GTATCAACTCAACTGTTGTTAAATGGCAGTCTAGCAGAAGAA 

GAGGTAGTAATTAGATCTGAO^ATTTTACGAACAATGCTAAA 

ACCATAATAGTACAGCTCAGCAAATCTGTAGAAATTACTTGT 

GTAAGACCCAACAACAATACAAGAAAAAGTATAAQTATGGGA 

CCAGGGAGAGCATTTTATACAACAGAAATAATAGGAGATATA 

AGAO^GCATATTGTAACATTAGTAAAGCAAACTGGACTGAC 

ACTTTAGAACAGATAGCTAGAAAATTAAGAGAACAATTTGAG 

AATAAAACAATAGTCTTTAAGCCATCCTCAGGAGGGGACCCA 

GAAATTGTAACACAGTTTTACAGTTTTAATTGTGGAGGGGAA 

TTTTTCTACTGTAATTCTUlCyVCAACTGTTTAATGGTACOT 

AATGGTACTTGGGTTAATGGTACTTGGAGTAGTAATAATACG 

ACTGATACTGOU^TATCACACTCCCaTGauSAATAAAACAA 

TTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCC 

CCTCCCATCAAAGGACAAATTAAATGTACATCAAATATTACA 

GGGCTGATATTAACAAGAGATGGTGGTAAGAATAACACCACG 

AAOSACAACGAGACCGAGACCrrTCAGACCTGGAGGAGGAGAT 

ATGAGGGACAATTGGAGAAGTGAATTATATAAATATAAAGTA 

GTACAAGTTGAACCATTAGGAGTAGCACCCACCAAGGCAAAG 

AGAAGAGTGGTGCAAAGAGAAAAAAGAGCAGTGGGTU^TAGGA 

GCTATGTTCCTTGGGTTCLTAGGAGCAGa^GGAAGCACTATG 

GGCGCAGCGTCAGTGACGCTGACGGTACAAGCCAGACAATTA 

TTGTCTGGTATAGTGCAGCAGCAGAACAATCTGCTGAGGGCT 

ATTGAGGCGCAACAGCATCTGTTGCaVACrra^CAGTC^^ 

ATCAAACAGCTCCAGGCAAGAGTCCTGGCrGTGGAAAGATAC 

CTAAGGGATOUiaiGCTCCTGGGACrTTGGGGTTGCT 

AAACTCATTTGCACCACTACTGTGCCTTaGAACAATAGCrrGG 

AGTAATAAATCrr(n:GGAAACAATTTGGGATAACATGACCTGG 

ATGCAGTGGGAAAGAGAAATTGACAATTACACAAACATAATA 

TACACCTTAATTGAAGAATCGCAGAACCAACAAGAAAAAAAT 

GAACTAGAATTATTGGAATTGGATAAATGGGCAAATTTGTGG 

AATTGGTTTAGTATATCAAACTGGCTATaOTATATAAAATTA 

TTCATAATGGTAGTAGGAGGCTTGGTAGGTTTAAGAATAGTT 
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TTTACTGTACTTTCTATAGTTAATAGAGTTAGGCAGGGATAC 

TCACCATTATCGTTTCAGACCCACCTCCCAACCCCGAAGGGA 

CCCGACAGGCCAGAAGGAATCGAAGAAGAAGGTGGAGAGAGA 

GACAGAGGCAGCTCCACTCGATTAGTGCACGGATTCTTAGCA 

CTTTTCTGGGACGACCTGAGGAGTCTGTGCCTCTTCAGCTAC 

CACCyiCrTGAGAGACTTACTCTTGATTGTAACGAGGATTGTG 

GAACTTCTGGGACGCAGGGGATGGGAAGCCCTCAAATACTGG 

TGGAATCTCCTGCAGTATTGGAGGCAGGAACTACAGAAGAGT 

GCTGTTAGCTTGTTCAATGGCACGGCCATAGCAG TAG CTGAG 

GGGACAGATAGAGTTATAGAAGCTTTACGAAGGGCTTATAGA 

GCTATTCTCCACATACCTAGAAGAATAAGACAGGGCTTAGAA 

AGQGCTTTGCTATAAAATGGGTGGCAAGTOGTCAGAAAGTAG 

TGTGGTTAGAAGGCATGTACCTTTAAGACAAGGCAGCTATAG 

ATCTTAGCCGCTTTTTAAAAGAAAAGGGGGGACTGGAAGGGC 

TAATTCACTCACGGAAAAGACCAGTTGAACCAGAAGAAGATA 

GAAGAGGCCATGAAGAAGAAAACAACAGATTGTTCTGCTTGC 

TCAGCT3GGGACriTCCAGAAGGCGCGGCCTmGT ^ 

CCCCGTTGGGGACTTTCCGAAGAGGCATGAAGGGACTTTCCA 

AGGCAGGCGTGGCCTGGGCGGGACTGGGGAGTGGCGAGCCCT 

CAGATGCT«CATATAAGCAOCTQCTTTCTOCCTGTACroGGT 

CTCTCTGGTTAGACCAGATCTGAGCCTOGGAGCTCTCTGGCT 

AGCTAGGGAACCCACTGCrrAAGCCTCAATAAAGCnrrGCCTT 

GAGTGCriTCAAGTAGTGTGTGCCCGTCTGTTGTGTGACTCTG 

GTATCTAGAGATCCCTCAGACCATTTTAGTCCGTGTGGAAAA 

TCTCTAGCA 
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